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METHODS FOR THF T RANSFORMATION OF VEGFT4L PLASTIDS 

The present invention relates to novel methods for the generation 
of transgenic plants with genetically edified plastids^nd to 
the transgenic plants generated using these methods. 

Biotechnological work carried out on plants aims at generatino 
Plants with advantageous novel properties, for example to 
fooH e , aS ^ agriC " ltUral P roductiv ^y, to increase the quality i n 
foodstuffs or for producing certain chemicals or pharmaceuL^ls . 

Plastids are organelles within plant cells which have their own 
genome. They play an essential role in photosynthesis and in 
amino acid and lipid biosynthesis. The plastids' genome consists 

120 \TTs l B T*T d ' CirCUlar ° NA With &n aVe "* e 8i - of from 
120 to 160 kb and is present - for example in leaf cells -as 

approximately 1900 to 50 000 copies per cell (Palmer ,1985) Ann 
Rev Genet 19:325-54). A single plastid has a copy number o 
approximately from 50 to 100. The term plastids combes 
chloroplasts, proplastids, etioplasts, chromoplastsfamjloplasts 

82 e -65? 66M V, elai ° Pla8tS < Heif ^ » (2000, P Biochimie y P 

82 655-666 . The various forms can be converted into one another 

and all arxse from the proplastids. This is why all manifested 

Prince i ^ — informati" 

Preference 16 given in the literature as starting material for 
the transformation of plastids, however, to green Zlls Llll 
comprise the chloroplasts as the manifested forT ' * 

develof Jr at T n ° m±C intSreSt f ° r Plant technologists to 
Tt 2 efflC16nt methods for the transformation of plastids 
(McFadden G (2001) Plant Physiol. 125:50-53). The stable 

^S^T * °* iS ~ « ^at 

in the transformation of plastids, the technique of undirected 
(illegitimate) DNA insertion, which is frequently employed in 

Smv T° nUCl6ar DNA ' haS the d -^anLge P thIt it is 
highly likely that an essential gene on the gene-dense plastidL 

IZT tl ^ Wh±Ch W ° Uld fregU6ntl * be leth' for the ' 
Plant The directed insertion of foreign DNA is therefore 
advantageous in plastids. 

various methods for the directed insertion into the plastidic 
genome have been described. The first to be described was plastid 
transformation in green algae (Boynton JE et al a9 88 7 
240: 1534 . 15 38; Blowers AD et al.^SS) Pl^ ieU "L^! 
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followed later by higher plants such as tobacco (Svab Z et al. 
(1990) Proc Natl Acad Sci USA 87:8526-8530). 

EF-A 0 251 654, US 5,932,479, US 5,451,513, US 5,877,402, 
5 WO 01/64024, WO 00/20611, WO 01/42441, WO 99/10513, WO 97/32977, 
WO 00/28014, WO 00/39313 describe methods and DNA constructs for 
the transformation of plastids of higher plants, where the DNA to 
be transformed is introduced into the plastome (plastidic genome) 
via homologous recombination ("double crossover"). In general, 

10 homologous regions of 1000 bp or more on either side of the 

sequence to be inserted are employed. This rapidly gives rise to 
large vectors whose handling is not very convenient. Moreover, 
the transformation efficiency drops. The homologous recombination 
efficiency drops with the increasing length of the foreign DNA to 

15 be integrated. A further disadvantage is the fact that a 

homologous region which can be utilized for the process of DNA 
integration by means of double crossover must be identified for 
each plant species. WO 99/10513 claims the identification of an 
intergenic DNA sequence with supposedly sufficient homology 

20 between the genomes of the chloroplasts of many higher plants, 
which DNA sequence can thus act as a universal target sequence. 
However, it has not been demonstrated that this vector can be 
utilized successfully in species other than tobacco; rather, in 
WO 01/64024, the same inventor adapts the transformation vector 

25 to non-tobacco plant species by using homologous DNA sequences 
isolated from these plant species. Since only few recombination 
events result in all of the above-described methods, selection of 
the recombinant plastidic DNA molecules is required. 

30 The plastid DNA of higher plants is present in the form of up to 
several thousand copies per cell. To ensure stable integration of 
foreign DNA, all copies of the plastidic DNA must be modified in 
the same manner. In plastid transformation, this is referred to 
as having reached the homotransplastomic state. This state is 

35 achieved by what is known as a segregation-and-sorting process, 
by exerting a continuous selection pressure on the plants. Owing 
to the continual selection pressure, those plastids in which many 
copies of the plastidic DNA have already been modified are 
enriched during cell and plastid division. The selection pressure 

40 is maintained until the homotransplastomic state is reached (Guda 
C et al. (2000) Plant Cell Reports 19:257-262). The modification 
of all of the copies of the plastidic genome in order to obtain 
homotransplastomic plants which have incorporated the foreign 
gene stably into their plastidic genome over generations without 

45 addition of a selection agent is a great challenge (Bogorad L 
(2000) TIBTECH 18:257-263). In addition to the continuous 
selection pressure, achieving the homotransplastomic state is, if 
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appropriate, ensured by repeatedly regenerating tissue which has 
already been transformed (Svab Z and Maliga p (1993) Proc Natl 
Acad sci USA 90:913-917). However, this procedure limits the 
plant material which is available for plastid transformation. 
5 Coupling, if appropriate, the transgene with another gene which 
is essential for the survival of the plant is therefore proposed. 
In most cases, tissue culture techniques and selection processes 
cannot be applied universally to all plant species and constitute 
a substantial limitation of plastid transformation, in particular 
10 with regard to the applicability of the method to species other 
than tobacco (Kota M et al. (1999) Proc Natl Acad 
Sci USA 96:1840-1845). A recently published transformation of 
tomato plastids is based on modifications in the regeneration and 
selection scheme (Ruf s et al. (2001) Nature Biotech 19:870-875), 
15 which, however, are expensive and time-consuming. Another 

approach aims at reducing the number of plastids per cell and the 
DNA molecules per plastid so that fewer DNA molecules have to be 
modified (Bogorad L (2000) TIBTECH 18:257-263). All of the 
selection and segregation processes are very time-consuming. 



20 



wo 99/10513 describes a method in which a plastidic ORI (origin 
of replication) is localized on the plasmid to be transformed in 
order to increase, in this manner, the number of copies of the 
vector to be transformed which are available for integration into 
25 the plastidic genome (Guda C et al. (2000) Plant Cell Reports 
19:257-262). 

The necessity of improving the plastid transformation technique 
xs also mentioned in Heifetz and Tuttle (Heifetz P and Tuttle AM 

30 (2001) Curr Opin Plant Biol 4:157-161). WO 00/32799 teaches 

increasing the efficiency of plastid transformation by employing 
plants with enlarged plastids. This results in a large plastid 
surface, through which the DNA to be transformed can enter the 
plastids with greater ease. However, the mechanism of DNA 

35 integration relies, again, on conventional homologous 

recombination, as was the case in the above-described methods. 

A variety of other methods for the sequence-specific integration 
of DNA - xn particular into the nuclear DNA - have been 

40 described. A method based on self -splicing group II introns has 
been described. Self -splicing group II introns are capable of 
inserting in a sequence-specific fashion, for example into 
intron-free genes. The sequence-specific hydrolysis of the target 
DNA is catalyzed by an RNA-protein (ribonucleoprotein) complex. 

45 Here, the sequence specificity of the endonuclease function is 
determined in particular by base pairings being formed between 
the RNA moiety of the ribonucleoprotein complex and the target 
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DNA. The use of group II introns as vectors for foreign DNA has 
been discussed. By modifying certain sequences of a group II 
intron, it was possible to modify the target specificity of the 
latter. Also r it was possible to insert further sequences into 
5 group II introns without destroying functions of the latter (Yang 
J et al. (1996) Nature 381:332-335; Eickbush TH (1999) Curr Biol 
9:R11-Rl4; Matsuura M et al. (1997) Genes Develop 11:2910-2924; 
Cousineau B et al. (1998) Cell 94: 451-462). The adaptation to 
certain target sequences and the determination of the associated 

10 rules / however , is laborious and has as yet been elucidated in 
detail only for the Ll.ltrB intron (Mohr G et al. (2000) Genes 
Develop 14:559-573). Moreover, the retrohoming efficiency was 
reduced significantly by the modification, and not every single 
one of the modified introns tested inserted into the desired 

15 target DNA. The disadvantage of the technique is that some 

positions in the nucleotide sequence are fixed, which limits the 
choice of the target region in the DNA to be transformed (Guo H 
et al. (1997) EMBO J 16:6835-6848). Moreover, the efficiency of 
the retrohoming process with regard to that of the wild-type 

20 intron appears to be diminished. The efficiency of intron 

insertion at different sites on the genes investigated differed 
with regard to its level. The work aimed at providing an improved 
method for the directed insertion of DNA into the nuclear DNA of 
organisms which permit no efficient homologous recombination (Guo 

25 et al. (2000) Science 289:452-456). The experiments described 
have been carried out extrachromosomally both in the prokaryote 
E.coli and in human cells. The applicability to the chromosomal 
DNA of higher organisms or the applicability to plastidic DNA was 
neither described nor demonstrated. It was merely proposed to 

30 attempt the optimization of this system in such a way that 
insertion into chromosomal DNA of higher eukaryotes can take 
place. This system is supposed to be an alternative method for 
higher eukaryotes which lack efficient homologous recombination 
(Guo et al. (2000) Science 289:452-456). This does not apply to 

35 plastids of higher plants, where homologous recombination - at 
least in the case of individual plastidic DNA molecules — can 
usually be performed without problems. 

Plastid transformation was demonstrated not only in tobacco, but 
40 also in potato (Sidorov VA et al. (1999) Plant J 19:209-216; WO 
00/28014), petunia (WO 00/28014), rice (Khan MS and Maliga P 
(1999) Nature Biotech 17:910-915; WO 00/07431; US 6,153,813), 
Arabidopsis (Sikdar SR et al. (1998) Plant Cell Reports 18: 
20-24; W0 97/32977) and oilseed rape (WO 00/39313). (Review 
45 article: Bogorad L (2000) TIBTECH 18:257-263). Transplastomic 
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tomato plants have also been described recently (Ruf S et al. 
(2001) Nature Biotech 19:870-875). 

The generation of sequence-specific double-strand breaks with the 
5 aid of restriction enzymes in eukaryotic genomes, including 
plants, has been described (Puchta H (1999) Methods Mol Biol 
113:447-451). 

WO 96/14408 describes the homing restriction endonuclease I-Scel 
10 and various possibilities for its use. An application for 
inserting DNA sequences into plastidic DNA is not described. 

Posfai et al. describe a method for the substitution of genes in 
the prokaryote E.coli (Posfai G et al. (1999) Nucleic Acids Res 

15 27(22) :4409-4415) . Here r an intramolecular recombination between 
the endogenous and the mutated gene takes place in the E.coli 
genome, which combination is induced by cleaving with the 
restriction enzyme I-Scel. Recombinations in E.coli proceed 
markedly more efficiently and, presumably, following different 

20 mechanisms than is the case in the nucleus of higher eukaryotes 
(for example described by Kuzminov A (1999) Microbiol Mol Biol 
Rev. 63(4) : 751-813 ) . 

"Homing" refers to the phenomenon that two or more copies of a 
25 DNA sequence exist in one compartment, where at least one of 

these two sequences is interrupted by a further DNA sequence, and 
a copy of the interrupting DNA sequence is subsequently also 
introduced into the noninterrupted DNA sequence. This phenomenon 
usually takes the form of intron homing. Here, two or more 
30 alleles of one gene exist in one compartment, where at least one 
of these alleles has no intron. A copy of the intron is 
subsequently also introduced into the intron-free allele. 

Introns in plastidic genes of higher plants have been described 
35 (Vogel J et al. (1999) Nucl Acids Res 27:3866-3874; Jenkins BD 
et al. (1997) Plant Cell 9:283-296; Xu MQ et al. (1990) Science 
250: 1566-1570). The splicing of a homologous, unmodified intron 
with the natural exon regions at an ectopic locus in the 
plastidic genome has likewise been described (Bock R and Maliga P 
40 (1995) Nucl Acids Res 23(13) :2544-2547) . Experiments of 

introducing, into plastids of higher plants, heterologous introns 
which are additionally modified in such a way that they comprise 
additional genetic information and/or splice in a nonnatural 
sequence environment have not been carried out as yet. 
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Experiments carried out by Eddy and Gold into the homing process 
in E.coli have demonstrated that certain recombination systems 
are required. The type of the recombination system of the host is 
a key variable (Eddy SR and Gold L (1992) Proc Natl Acad Sci USA 
5 89:1544-1547). It was therefore impossible to assume that the 
naturally occurring homing process of one organism can be applied 
at will to another organism, in particular when this process 
probably does not occur naturally in the latter organism. 

10 DUrrenberger et al. describe the induction of an intrachromosomal 
recombination in chloroplasts of the single-celled green alga 
Chlamydomonas reinhardtii using the I-Crel homing endonuclease 
(DUrrenberger F et al. (1996) Nucleic Acid Res 24(17 ): 3323-3331 ) . 
The recombination takes place between the endogenous 23S gene and 

15 a 2 3S-cDNA which is inserted into the chromosome of an I-Crel 
deletion strain and which comprises an I-Crel cleavage site. 
Double-strand breaks are induced by mating the relevant 
transgenic organism with an organism which naturally expresses 
I-Crel. At the point in time of the double-strand break, the 

20 foreign DNA is already inserted into the chromosomal DNA, and 
recombination takes place intramolecularly and not between two 
separate molecules. 

It has been shown recently that a mobile intron which naturally 

25 occurs in Chlamydomonas reinhardtii and which also encodes a 
homing endonuclease can be transformed efficiently into an 
intron-free copy (Odom OW et al. (2001) Mol Cell Biol 21: 
3472-3481). In this work, the increase of the transformation rate 
was dependent on the presence of the homing endonuclease. In the 

30 discussion, it is proposed in general terms and without specific 
suggestions regarding the implementation, to improve plast id 
transformation by inducing double-strand breaks. To this end, the 
recognition regions of rare nucleases were initially to be 
introduced in a first step, and the subsequent integration event 

35 was then to take place at the same locus. More detailed 

suggestions regarding the manner in which the recognition regions 
are to be introduced, the type of nucleases and recognition 
regions which can be used, the way in which the first step and 
the second step can be designed in actual reality, and the like, 

40 are not provided. All that has been shown to date is that the 
introduction of a homologous intron, into plast ids of the alga 
Chlamydomonas, by means of the homing endonuclease naturally 
associated with the mobility of the intron did work. Moreover, 
the results were generated in an algal species. The 

45 abovementioned experiments by Eddy and Gold with E • coli, where 
no mobile group I introns are known, as is the case with the 
plastids of higher plants, demonstrate that an applicability to 



0817/00028 



CA 02470329 2004-06-14 



7 

heterologous systems is not readily feasible. It is therefore by 
no means obvious for the skilled worker to apply the observations 
on the alga Chlamydomonas to higher plants. In contrast, there 
are a number of suggestions which make such an applicability 
5 rather doubtful: 



1. Homing systems cannot be applied readily from one system to 
another (Eddy SR and Gold L (1992) Proc Natl Acad Sci USA 
89:1544-1547). The applicability to higher plants is all the 

10 more dubious since no homing endonucleases have been 

identified in those plastidic genomes of higher plants which 
have already been sequenced 

( http : / /megasun . bch . umontreal . ca/ogmp/pro j ects/other/cp_list . 
html). It can therefore be assumed that the introns found in 
15 the plastidic genome of higher plants are not mobile, and 

that no homing mechanism exists naturally in these genomes. 

2. Chlamydomonas only has one plastid per cell, while in cells 
of higher plants up to 100 plastids are present per cell. 

20 

3. The efficiency of conventional plastid transformation in 
Chlamydomonas exceeds that in higher plants by several orders 
of magnitude, which suggests that these two systems cannot be 
compared directly with one another. As regards the 

25 regeneration of transplastomic algae or transplastomic 

plants, the fact that division of the algal plastids is 
synchronized with the cell cycle, while this is not the case 
for the plastids of the higher plants, might also play an 
important role (Sato N (2001) Trends Plant Science 

30 6:151-155). 

4. The mechanisms of DNA integration into plastids of 
Chlamydomonas and of higher plants appear to be fundamentally 
different. Thus, it has been found that inter-specific 
plastid transformation (where homologous regions are utilized 
instead of identical sequences) in Chlamydomonas leads to a 
marked reduction of the transformation efficiency, which was, 
however, not observed in tobacco. This also applies 
analogously to the distance of a molecular marker on the 
homologous DNA from the heterologous sequence on the 
transformation plasmid: the closer the molecular marker to 
the edge of the target region for integration by means of 
double crossover, the less frequently it is transferred when 
transformed into Chlamydomonas plastids. In tobacco, multiple 
recombination mechanisms were observed, but here even 
molecular markers which were close to the edge of the 
homologous regions were transferred efficiently into the 
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plastidic genome during transformation (Kavanagh TA et al. 
(1999) Genetics 152: 1111-1122 and references cited therein). 

5. In Chlamydomonas , the plastids of the two parents fuse during 
5 hybridization, even in the case of inter-specific 

hybridization. In Chlamydomonas , plastid fusion is a natural 
process, and the DNA of the plastids too is mixed and 
undergoes new recombination. This is why mobile introns in 
the organelles of these organisms make sense. In contrast, in 

10 most of the higher plants, the plastids are inherited 

uniparentally, so that neither mixing of the plastidic DNA 
results nor recombinations can occur between the maternal and 
the paternal plastidic DNA. Even in those plant species in 
which the plastids are inherited biparentally, no plastid 

15 fusion was observed. It can therefore be assumed that natural 

plastid fusion in higher plants can be ruled out (Hagemann R 
(1992) plastidic genetics in higher plants; in Cell 
Organelles, editors Herrmann RG, Springer Verlag, Vienna, 
pp. 65-96) and that mechanisms like intron homing are either 

20 not developed or even suppressed. 

Increasing the homologous recombination efficiency within the 
nuclear DNA with the aid of rare endonucleases has been described 
for various organisms (Puchta H et al. (1993) Nucleic Acids 

25 Research. 21(22 ): 5034-40; Puchta H et al. (1996) Proc Natl Acad 
Sci USA 93:5055-5060; Rong YS and Golic KG (2000) Science 
28:2013-2018; Jasin M (1996) Trends Genet 12: 224-229). In 
contrast to plastids , insertion by homologous recombination into 
the nuclear DNA is problematic and usually takes place owing to 

30 random illegitimate integration. This demonstrates that 

techniques which are established for the nuclear genome cannot 
necessarily be applied to the plastids. In contrast to the 
situation regarding the nucleus, integration in plastids of 
higher plants takes place virtually exclusively, and with high 

35 efficiency, via homologous recombination (Bock R and Hagemann R 
(2000) Progress in Botany 61:76-90; Maliga P et al. (1994) 
Homologous recombination and integration of foreign DNA in 
plastids of higher plants. In Homologous recombination and gene 
silencing in plants. Paszkowski J, ed. (Kluwer Academic 

40 publishers), pp. 83-93). 

The homologous recombination efficiency for the integration of 
DNA into the plastome has generally not been thought of as a 
limiting factor and, in contrast, considered as not being 
45 critical. Accordingly, current research into the optimization of 
plastid transformation does not focus on the optimization of 
homologous recombination but for example on improved selection 



0817/00028 



CA 02470329 2004-06-14 



9 

markers , improved selection and regeneration techniques and the 
like. Nevertheless, no essential breakthrough has been achieved 
to date. 

5 as emphasized clearly by the above-described methods and problems 
in the transformation of plastids, providing novel methods for 
the generation of homotransplastomic plants is a long-existing, 
unmet need of plant biotechnology. A further need is the 
avoidance of antibiotic or herbicide selection markers for 
10 reasons of registration and consumer acceptance. To date, no 

plastid transformation method has been described which does away 
with the need for such a selection marker. 

It is therefore an object to develop novel methods which ensure 
15 efficient integration of foreign DNA in all copies of the 

plastidic DNA and which make possible the efficient selection of 
corresponding homotransplastomic plants. Surprisingly, this 
object has been achieved by providing the integration/selection 
method according to the invention. 

20 

A first subject matter of the invention relates to a method for 
the integration of a DNA sequence into the plastidic DNA of a 
multi-celled plant or cell derived therefrom and for the 
selection of predominantly homotransplastomic cells or plants, 
25 wherein 

a) the plastidic DNA molecules of said multi-celled plant or 
cell derived therefrom comprise at least one recognition 
sequence for the directed induction of DNA double-strand 

30 breaks and 

b) at least one enzyme suitable for the induction of DNA 
double-strand breaks at the recognition sequence for the 
directed induction of DNA double-strand breaks and at least 

35 one transformation construct comprising an insertion sequence 

are combined in at least one plastid of said multi-celled 
plant or cell derived therefrom, and 

c) DNA double-strand breaks are induced at the recognition 
40 sequences for the directed induction of DNA double-strand 

breaks, and 

d) the insertion sequence inserts into the plastidic DNA, the 
functionality of the recognition sequence for the directed 

45 induction of DNA double-strand breaks being deactivated so 

that said recognition sequence is no longer capable of being 
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cleaved by the enzyme suitable for the induction of DNA 
double-strand breaks, and 

e) plants or cells in which the insertion sequence has been 
5 inserted into the plastidic DNA molecules are isolated. 

Surprisingly, the system makes possible a substantial increase of 
the efficiency in the generation of predominantly 
homotransplastomic plants. In this context, not only the efficacy 
10 of insertion into the plastidic DNA, but also the efficacy of the 
selection process of predominantly homotransplastomic plants are 
increased. 

Application of the method according to the invention results in a 

15 selection pressure of incorporating the insertion sequence into 
all of the copies of the plastidic DNA. Ideally, the insertion 
sequence is spread independently of the selection markers, such 
as herbicide or antibiotic resistances. This has pronounced 
advantages with regard to registration and/or consumer 

20 acceptance. However, the use of such selection markers can 
further increase the efficiency. The method according to the 
invention for the generation of homoplastomic plants clearly 
outperforms the prior-art methods since it shows a more rapid, 
more efficient and therefore more economical route to obtain 

25 homotransplastomic plants. A further advantage of the system is 
that the size of the constructs employed for the transformation 
can be kept small since the homologous regions in the plastid 
transformation vector can be smaller in comparison with the 
integration by means of double crossover, or can be completely 

30 absent. 

The transformation of plastids has a large number of advantages 
over the transformation of the nucleus. The following are to be 
mentioned inter alia: 

35 

a) While homologous recombination into the nuclear DNA can only 
be realized with difficulty, DNA in plastids can be 
integrated readily at a predefined locus by means of double 
crossover, a form of homologous recombination. Positional 
40 effects or gene silencing, which are encountered in 

transformations of the nucleus owing to the illegitimate 
integration at a non-predefined locus, are thus avoided. 



b) 

45 



Very high expression levels can be achieved, presumably owing 
to the high copy number of the plastidic DNA. 
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c) In higher plants, plastidic DNA is, as a rule, only subject 
to maternal inheritance so that the foreign DNA introduced 
cannot be spread via pollen and cross-pollination can thus be 
prevented effectively. 

5 

d) The prokaryotic nature of the plastids makes possible the 
expression of genes in the context of a polycistronic operon 
structure. It is therefore not necessary to equate each gene 
to be expressed with its own promoter and the like. This 

10 facilitates the introduction of a large number of genes in 
one pass, for example for introducing entire biosynthetic 
pathways into the plastids. 

"Plastid" refers to the proplastids and to all organelles to 
15 which they give rise, such as, for example, chloroplasts, 

etioplasts, chromoplasts, amyloplasts , leukoplasts, dermaplasts 
and elaioplasts (Heifetz P (2000) Biochimie 82:655-666). 

"Plastome" refers to the genome, i.e. the totality of the genetic 
20 information , of a plastid. 

"Homotransplastomic" refers to a transplastomic and homoplastomic 
state. 

25 With regard to, for example, a plant, cell, tissue, plastid or a 
plastidic DNA, "transplastomic" refers to all those forms of the 
above, realized by recombinant methods, which comprise a 
plastidic DNA which has been modified by recombinant methods, it 
being possible for the modification to comprise, for example, 

30 substitutions, additions, deletions, inversions or insertions of 
one or more nucleotide residues. 

"Heteroplastomic" refers to the presence of a mixed population of 
a variety of plastidic DNAs within a single plastid or within a 
35 population of plastids within a plant cell or tissue. 

"Homoplastomic" refers to a uniform population of plastidic DNA 
within a single plastid or within a population of plastids within 
a plant cell or tissue. Homoplastomic cells, tissues or plants 
40 are genetically stable since they only comprise one type of 
plastidic DNA, i.e. they generally remain homopolastomic even 
when the selection pressure ceases. Progeny obtained by selfing 
are likewise homoplastomic • 

45 For the purposes of the present invention, "predominantly 

homoplastomic" or "predominantly homotransplastomic" refers to 
all those plants or cells in which the percentage of the desired 
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plastidic DNA molecules which have been modified with regard to a 
trait — for example with the recognition sequence for the 
directed induction of DNA double-strand breaks or the inserted 
insertion sequence — amounts to at least 50%, preferably at least 
5 70% , very especially preferably at least 90%, most preferably at 
least 95% of the totality of all plastidic DNA molecules in a 
plant or a tissue, cell or plastid of same. Predominantly 
homoplastomic or predominantly homotransplastomic plants <:an be 
converted into homoplastomic or homotransplastomic plants by 

10 continued maintenance of the selection pressure and, if 

appropriate, repeated regeneration steps. Owing to the homing 
process, however, a continuous selection pressure is not 
necessarily required. In a particular embodiment, a predominantly 
homoplastomic, or homotransplastomic, plant is therefore truly 

15 homoplastomic, or homotransplastomic. A plant which, with regard 
to a DSB recognition sequence, is predominantly homoplastomic or 
homotransplastomic, or truly homoplastomic or homotransplastomic, 
is subsequently referred to as "master plant". The percentage of 
the desired plastidic DNA molecules which have been modified with 

20 regard to a trait can be determined in the manner known to the 
skilled worker, for example by means of Southern analysis as 
described by way of example in Example 4 . The ratio between the 
plastid starting DNA molecules and the plastidic DNA molecules 
which have been modified with regard to a trait can be determined 

25 by comparing the intensity of the bands in question. 

"Multi-celled plant or cell derived therefrom" refers generally 
to all those cells, tissues, parts or propagation materials (such 
as seeds or fruits) of a plant which constitutes, or may 

30 constitute, a multi-celled organism in its adult state. Included 
for the purpose of the invention are all genera and species of 
higher and lower plants of the plant kingdom. Annual, perennial, 
monocotyledonous and dicotyledonous plants are preferred. 
Included are mature plants, seeds, shoots and seedlings, and 

35 parts derived therefrom, propagation material (for example 

tubers, seeds or fruits) and cultures, for example cell or callus 
cultures. "Mature plants" means plants at any developmental stage 
beyond the seedling stage. The term seedling means a young 
immature plant an an early developmental stage. 

40 

Preferred plants are those from the following plant families: 
Amaranthaceae , Aster aceae, Brassicaceae, Carophyllaceae, 
Chenopodiaceae , Compositae, Cruciferae, Cucurbitaceae, Labiatae, 
Leguminosae, Papilionoideae, Liliaceae, Linaceae, Malvaceae, 
45 Kosaceae, Rubiaceae, Saxifragaceae, Scrophulariaceae, Solanaceae, 
Sterculiaceae, Tetragoniaceae, Theaceae, Umbelliferae. 
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Preferred monocotyledonous plants are selected in particular from 
the monocotyledonous crop plants such as, for example, the 
Gramineae family such as rice, maize, wheat or other cereal 
species such as barley, millet and sorghum, rye, triticale or 
5 oats, and sugar cane, and all grass species. 

Preferred dicotyledonous plants are selected in particular from 
dicotyledenous crop plants, such as, for example, 

10 - Asteraceae such as sunflower, tagetes or calendula and others, 

- Compositae, especially the genus Lactuca, very particularly the 
species sativa (lettuce) and others, 

15 - Cruciferae, particularly the genus Brassica, very particularly 
the species napus (oilseed rape), campestris (beet) r oleracea 
cv Tastie (cabbage), oleracea cv Snowball Y (cauliflower) and 
oleracea cv Emperor (broccoli) and other cabbages; and the 
genus Arabidopsis, very particularly the species thaliana, and 

20 cress or canola and others, 

- Cucurbitaceae such as melon, pumpkin/ squash or zucchini and 
others , 

25 - Leguminosae, particularly the genus Glycine, very particularly 
the species max (soybean), soybean, and alfalfa, pea, bean or 
peanut and others, 

- Rubiaceae, preferably the subclass Lamiidae such as, for 

30 example, Coffea arabica or Coffea liberica (coffee bush) and 
others , 

- Solanaceae, particularly the genus Lycopersicon, very 
particularly the species esculentum (tomato), the genus 

35 Solanum, very particularly the species tuberosum (potato) and 
melongena (aubergine) and tobacco or paprika and others , 

- Sterculiaceae, preferably the subclass Dilleniidae such as, for 
example, Theobroma cacao (cacao bush) and others, 

40 

- Theaceae, preferably the subclass Dilleniidae such as, for 
example, Camellia sinensis or Thea sinensis (tea shrub) and 
others, 

45 - Umbelliferae, particularly the genus Caucus (very particularly 
the species carota (carrot)) and Apium (very particularly the 
species graveolens dulce (celery)) and others? and the genus 
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Capsicum, very particularly the genus annuum (pepper) and 
others , 

and linseed, soybean, cotton, hemp, flax, cucumber, spinach, 
5 carrot, sugar beet and the various tree, nut and grapevine 
species, in particular banana and kiwi fruit. 

Also encompassed are ornamental plants, useful or ornamental 
trees, flowers, cut flowers, shrubs or turf. Plants which may be 

10 mentioned by way of example but not by limitation are 
angiosperms, bryophytes such as, for example, Hepaticae 
(liverworts) and Musci (moBses); pteridophytes such as ferns, 
horsetails and clubmosses; gyranosperms such as conifers, cycads, 
ginkgo and Gnetatae, the families of the Rosaceae such as rose, 

15 Ericaceae such as rhododendron and azalea, Euphorbiaceae such as 
poinsettias and croton, Caryophyllaceae such as pinks, Solanaceae 
such as petunias, Gesneriaceae such as African violet, 
Balsaminaceae such as touch-me-not, Orchidaceae such as orchids, 
Iridaceae such as gladioli, iris, freesia and crocus, Compositae 

20 such as marigold, Geraniaceae such as geranium, Liliaceae such as 
dracena, Moraceae such as ficus, Araceae such as philodendron and 
many others. 

Most preferred are Arabidopsis thaliana, Nicotiana tabacum, 
25 Tagetes and Brassica napus and all those genera and species which 
are used as foods or feeds, such as the above-described cereal 
species, or which are suitable for the production of oils, such 
as oil plants, nut species, soybean, sunflower, pumpkin/squash 
and peanut. 

30 

"Enzyme suitable for inducing DNA double-strand breaks at the 
recognition sequence for the directed induction of DNA 
double-strand breaks" (hereinbelow referred to as "DSBI enzyme" 
for "double strand-break inducing enzyme") generally refers to 
35 all those enzymes which are capable of generating, in a 

sequence-specific manner, double-strand breaks in double-stranded 
DNA. The following may be mentioned by way of example, but not by 
limitation: 

40 1. Restriction endonucleases, preferably type II restriction 

endonuc leases, especially preferably homing endonucleases as 
described in detail hereinbelow. 



45 
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2. Artificial nucleases such as described in detail hereinbelow, 
such as, for example, chimeric nucleases, mutated restriction 
or homing endonuc leases, or RNA protein particles derived 
from mobile group II introns. 

5 

Both natural and artificially generated DSBI enzymes are 
suitable. Preferred are all those DSBI enzymes whose recognition 
sequence is known and which can be obtained either in the form of 
their proteins (for example by purification) or which can be 
10 expressed using their nucleic acid sequence. 

The DSBI enzyme, whose specific recognition sequence is known, is 
preferably selected in such a way that it has no further 
functional recognition regions in the plastidic genome, in 
15 addition to the target recognition sequence. Homing endonucleases 
are therefore very especially preferred (review: Belfort M and 
Roberts RJ (1997) Nucleic Acids Res 25:3379-3388; Jasin M (1996) 
Trends Genet 12:224-228; website: 

http://rebase.neb.com/rebase/rebase.homing.html; Roberts RJ and 
20 Macelis D (2001) Nucleic Acids Res 29: 268-269). These meet this 
requirement owing to their long recognition sequences. Owing to 
the small size of the plastome, however, it is also feasible that 
DSBI enzymes with shorter recognition sequences (for example 
restriction endonucleases) can be employed successfully. 

25 

In addition to the above-described preferred embodiment, where 
only a singular recognition sequence for the DSBI enzyme is 
present in the plastidic DNA, cases where further, functionally 
identical, recognition sequences can be employed advantageously 
30 are also feasible. This is the case in particular when the 

plastome comprises duplicated genes (for example in the form of 
inverted repeats). Here, integration into all copies is to take 
place, so that cleavage in all copies is likewise desirable. 

35 The sequences which encode such homing endonucleases can be 

isolated for example from the chloroplast genome of Chlamydomonas 
(Tunnel M et al. (1993) J Mol Biol 232: 446-467). They are small 
(18 to 26 kD), but have, in their open reading frame (ORF), a 
"coding usage" which is directly suitable for expression in the 

40 nucleus or plastids of higher plants (Monnat RJ Jr et al. (1999) 
Biochem Biophys Res Com 255:88-93). 

Further homing endonucleases are mentioned in the abovementioned 
website; homing endonucleases which may be mentioned are, for 
45 example, F-Scel, F-Scell, F-Suvl, F-TevI, F-TevII, I-Amal f 
I-Anil, I-Ceul, I-CeuAIIP, I-Chul, I-Cmoel, I-Cpal, I-CpaII, 
I-Crel, I-CrepsbIP, 1-CrepsbIIP, I-CrepsbIIIP, I-CrepsbIVP, 
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I-CsmI, I-Cvul, I-CvuAIP, I-DdiII, I-Dirl, I-Dmol, I-HspNIP, 
I-Llal, I-Msol, I-Naal, I-NanI, 1-NclIP, I-NgrIP, I-NitI, I-Njal, 
I-Nsp236IP, I-PakI, I-PboIP, I-PcuIP, I-PcuAI, I-PcuVI, I-PgrlP, 
I-PobIP, I-Porl, I-PorIIP, I-PpbIP, I-Ppol, I-SPBetaIP, I-Scal, 
5 I-Scel f I-Scell, I-SceIII , I-SceIV, I-SceV, I-SceVI, I-SceVII, 
I-SexIP, I-SneIP, I-SpomCP, I-SpomIP, I-SpomIIP, I-SquIP, 
I-Ssp6803I, I-SthPhiJP, I-SthPhiST3P, I-SthPhiS3bP, I-TdeIP, 
I-TevI, I-TevII, I-TevIII, I-UarAP, I-UarHGPAlP, I-UarHGPA13P, 
I-VinIP, I-ZbiIP, PI-MtuI, PI-MtuHIP, PI-MtuHIIP, Pl-Pful, 
10 Pl-PfuII, Pl-Pkol, Pl-PkoII, PI-PspI, PI-Rma43812lP, PI-SPBetalP, 
Pl-Scel, PI-Tful, PI-TfuII, PI-Thyl, PI-Tlil, PI-Tlill. 

Preferred in this context are those homing endonucleases whose 
gene sequences are already known, such as f for example, F-Scel, 
15 I-Ceul, I-Chul, I-Dmol, I-Cpal, I-CpaII, I-Crel, I-CsmI, F-TevI, 
F~TevII, i-TevI, I-TevII, I -Anil, I-Cvul, I-Llal, I-NanI, I-Msol, 
I-NitI, I-Njal, I-PakI, I-Porl, I-Ppol, I-Scal, I-Scel, 
I-Ssp6803l, Pl-Pkol, Pl-PkoII, PI-PspI, PI-Tful, PI-Tlil. 

20 Homing endonucleases which are especially preferably used are 
those which are found as such naturally, especially preferably 
those which are found naturally in organelles. Most preferably, 
the homing endonucleases originate from organisms which live at 
similar temperatures to plants. Those which are of particular 

25 interest in this contest are the homing endonucleases identified 
in yeast and Chlamydomonas species. Naturally, it is also 
feasible to utilize homing endonucleases which are isolated from 
extremophilic organisms, as long as they are active in the 
plastids of the plant to be transformed. 

30 

The following are very especially preferred: 

I-Ceul (Cote MJ and Turmel M (1995) Curr Genet 27:177-183.; 
Gauthier A et al. (1991) Curr Genet 19:43-47; Marshall (1991) 
35 Gene 104:241-245; GenBank Acc. No.: Z17234 nucleotides 5102 

to 5758), 

I-Chul (Cote V et al. (1993) Gene 129:69-76; GenBank Acc. 
No.: L06107, nucleotides 419 to 1075), 

40 

I-Cmoel (Drouin M et al. (2000) Nucl Acids Res 28: 
4566-4572) , 



45 
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I-Cpal from Chlamydomonas pallidostigmatica (GenBank Acc. 
No.: L36830, nucleotides 357 to 815; Turmel M et al. (1995) 
Nucleic Acids Res 23:2519-2525; Tunnel, M et al, (1995) 
Mol Biol Evol 12:533-545; see also SEQ ID NO: 13 and 14) 

5 

I-CpaII (Turmel M et al. (1995) Mol Biol Evol 12:533-545? 
GenBank Acc. No.: L39865, nucleotides 719 to 1423), 

I-Crel (Wang J et al. (1997) Nucleic Acids Res 25: 3767-377 6; 
10 Durrenberger, F and Rochaix JD (1991) EMBO J 10:3495-3501; 

GenBank Acc. No.: X01977, nucleotides 571 to 1062), 

I-Csml (Ma DP et al. (1992) Plant Mol Biol 18:1001-1004) 

15 - . I-NanI (Elde M et al. (1999) Eur J Biocbem. 259:281-288; 
GenBank Acc. No.: X7828Q, nucleotides 418 to 1155), 

I-NitI (GenBank Acc. No.: X78277, nucleotides 426 to 1163), 

20 - I-Njal (GenBank Acc. No.: X78279, nucleotides 416 to 1153), 

I-Ppol encodes on the extrachromosomal DNA in the nucleus of 
Physartim polycephalum (Muscarella DE and Vogt VM (1989) 
Cell 56:443-454; Lin J and Vogt VM (1998) Mol Cell Biol 

25 18:5809-5817; GenBank Acc. No.: M38131, nucleotides 66 to 

577). In addition, the longer sequence encoding I-Ppol, which 
originates from an alternative start codon, may also be used. 
This sequence comprises the nucleotides 20 to 577 in the 
sequence of GenBank Acc. No. M38131. It is preferred to use 

30 the shorter sequence; however, it can be substituted at any 

location by the corresponding, longer one. A sequence which 
is especially preferred for the purposes of the present 
invention is an artificial sequence which encodes for the 
same amino acid as the sequence of the nucleotides 86 to 577 

35 of the sequence of GenBank Acc. No.: M38131 (see also SEQ ID 

NO: 5, 11, 12, 70 or 71), 

I-Pspl (GenBank Acc. No.: U00707, nucleotides 1839 to 3449), 

40 - I-Scal (Monteilhet C et al. (2000) Nucleic Acids Res 28: 

1245-1251; GenBank Acc. No.: X95974, nucleotides 55 to 465) 

I-Scel from the mitochondria of bakers' yeast (WO 96/14408; 
US 5,962,327 Seq ID NO: 1 therein), 

45 
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Endo Seel (Kawasaki et al. (1991) J Biol Chem 266:5342-5347, 
identical with F-Scel; GenBank Acc. No.: M63839, nucleotides 
159 to 1589), 

5 - I-SceII (Sarguiel B et al. (1990) Nucleic Acids Res 
18:5659-5665), 

I-SceIII (Sarguiel B et al. (1991) Mol Gen Genet. 
255:340-341) , 

10 

I-Ssp6803I (GenBank Acc. No.: D64003, nucleotides 35372 to 
35824), 

I-Tevl (Chu et al. (1990) Proc Natl Acad Sci USA 
15 87:3574-3578; Bell-Pedersenet al. (1990) Nucleic Acids Res 

18:3763-3770; GenBank Acc. No.: AF158101, nucleotides 144431 
to 143694), 

I-TevII (Bell-Pedersen et al. (1990) Nucleic Acids Res 
20 18:3763-3770; GenBank Acc. No.: AF158101, nucleotides 45612 

to 44836), 

I-TevIII (Eddy et al. (1991) Genes Dev. 5:1032-1041) , 

25 Commercially available homing endonucleases such as I-Ceul, 

I-Scel, I-Ppol, PI-PspI or Pl-Scel are very especially preferred. 
Most preferred are I-Scel and I-Ppol. While the gene encoding 
I-Ppol can be used in its natural form, the gene encoding I-Scel 
contains an editing locus. Since, in contrast to yeast 

30 mitochondria, the editing in question is not performed in the 
plastids of higher plants, an artificial sequence which encodes 
the I-Scel protein must be employed for the heterologous 
expression of this enzyme (US 5,866,361). 

35 In addition to the above-stated homing endonucleases, there are 
further intron-encoded enzymes which can be found at homologous 
locations of the genomes of related organisms. As a rule, these 
homing endonucleases have similar sequence specificity and are 
therefore equally suited as DSBI enzyme for the introduction of a 

40 DSB into the plastome at the DSB recognition sequence. The 
corresponding sequence is thus also recognized by Sob2593c, 
Clu2593c, Col2593c, Ciy2593c, Hla2593c, Cag2593c, I-CvuX, I-PakI, 
Tmu2593c, Msp2593c, I-Msol, Sdu2593c, Mvi2593m, Nol2593m or 
Aca2593m, in addition to I-Crel. Corresponding sequence is also 

45 recognized by I-Cecl, I-Cmol, I-Cell, I-CpaIII, I-Cmul, I-Clui, 
I-SobI or I-Astl, in addition to I-Ceul. A corresponding sequence 
is also recognized by Cbrl931c, Cfrl931c, Cmel931c, Cgel931c, 
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Pcrl931c, Mspl931c, Msol931c, Ptul931c, Cvul931m f Mspl931m, 
Msol931m, Noll931m, Acal931m or Snel931b, in addition to I-Cpal. 
Moreover, introns exist which are inserted at position 1951 of 
the 23S rDNA (numbering refers to homologous position in the 23S 
5 rDNA of E. coli). These introns / again, encode putative homing 
endonucleases which can be used as DSBI enzymes for the specific 
cleavage of the plastidic DNA. They include, for example, 
Cbrl951c, Mspl951c r Msol951c, Cvul951m or Acal951ra (Lucas P et 
al. (2001) Nucl Acids Res 29:960-969). 

10 

Most preferred are the homing endonucleases of the protein 
sequences described by SEQ ID NO: 5, 12 or 14. When preparing 
corresponding expression cassettes, accordingly, nucleic acid 
sequences are employed which encode a protein as shown in SEQ ID 
15 NO: 5, 12 and 14, respectively; especially preferred in this 

context is the use of the nucleic acid sequences as shown in SEQ 
ID NO: 11 and 13 or of an expression cassette as shown in SEQ ID 
NO: 4. 

20 The enzymes can be isolated from their source organisms in the 
manner with which the skilled worker is familiar and/or the 
nucleic acid sequence encoding them can be cloned. The sequences 
of a variety of enzymes have been deposited at GenBank (see 
above) . 

25 

Examples of artificial DSBI enzymes by way of example are 
chimeric nucleases which are composed of an unspecific nuclease 
domain and a sequence-specific DNA binding domain (for example 
consisting of zinc fingers) (Smith J et al* (2000) Nucl Acids 

30 Res 28(17) :3361-3369; Bibikova M et al. (2001) Mol Cell Biol. 
21:289-297). Thus, for example, the catalytic domain of the 
restriction endonuclease Fokl has been fused with zinc finger 
binding domains, whereby the specificity of the endonuclease has 
been defined (Chandrasegaran S & Smith J (1999) Biol Chem 

35 380:841-848; Kim YG & Chandrasegaran S (1994) Proc Natl Acad Sci 
USA 91:883-887; Kim YG et al. (1996) Proc Natl Acad Sci USA 
93:1156-1160). The catalytic domain of the yeast Ho endonuclease f 
too, has already been conferred a predefined specificity, using 
the above-described technique, by fusing it with the zinc finger 

40 domain of transcription factors (Nahon E & Raveh D (1998) Nucl 
Acids Res 26:1233-1239). 

As mentioned, zinc finger proteins are particularly suitable as 
DNA binding domain for the purpose of chimeric nucleases. These 
45 DNA-binding zinc finger domains can be adapted to match any 
desired DNA sequence. Suitable methods for the preparation of 
such zinc finger domains are described and known to the skilled 
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worker (Beerli RR et al. (2000) Proc Natl Acad Sci 
USA 97(4) :1495-1500; Beerli RR et al. (2000) J Biol Chem 
275(42) :32617-32627; Segal DJ and Barbas CF 3rd. (2000) Curr 
Opin Chem Biol 4(l):34-39; Kang JS and Kim JS (2000) J Biol Chem 
5 275(12) :8742-8748; Beerli RR et al. (1998) Proc Natl Acad Sci USA 
95(25) : 14628-14633; Kim JS et al. (1997) Proc Natl Acad Sci USA 
94(8) :3616-3620; Klug A (1999) J Mol Biol 293(2) :215-218; Tsai SY 
et al. (1998) Adv Drug Deliv Rev 30(1-3) : 23-31; Mapp AK et al. 
(2000) Proc Natl Acad Sci USA 97(8) : 3930-3935; Sharrocks AD 

10 et al. (1997) Int J Biochem Cell Biol 29(12) : 1371-1387; Zhang L 
et al. (2000) J Biol Chem 275(43) : 33850-33860 ) . Methods for the 
preparation and selection of zinc finger DNA binding domains with 
high sequence specificity have been described (WO 96/06166, 
WO 98/53059, WO 98/53057). Fusing a DNA binding domain thus 

15 obtained with the catalytic domain of an endonuclease (such as, 
for example, the Fokl or Ho endonuclease) allows the preparation 
of chimeric nucleases with any desired specificity which can be 
employed advantageously as DSBI enzymes for the purposes of the 
present invention. 

20 

Artificial DSBI enzymes with modified sequence specificity can 
also be prepared by mutating known restriction endonucleases or 
homing endonucleases by methods known to the skilled worker. The 
mutagenesis of maturases with the purpose of obtaining a modified 

25 substrate specificity is of particular interest, besides the 

mutagenesis of homing endonucleases. Frequently, maturases share 
many features with homing endonucleases and, if appropriate, they 
can be converted into nucleases by carrying out few mutations. 
This has been shown, for example, for the maturase in the bakers' 

30 yeast bi2 intron. Only two mutations in the maturase-encoding 

open reading frame (0RF) sufficed to confer a homing endonuclease 
activity to this enzyme (Szczepanek & Lazowska (1996) EMBO J 
15:3758-3767) . 

35 Further artificial nucleases can be generated with the aid of 
mobile group II introns and the proteins encoded by them, or 
parts of these proteins. Many mobile group II introns, together 
with the proteins encoded by them, form RNA-protein particles 
which are capable of recognizing, and cleaving, DNA in a 

40 sequence- specific manner. Here, the sequence specificity can be 
adapted to suit the needs by mutating certain intron regions (see 
hereinbelow) (WO 97/10362) . 



45 
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The skilled worker is familiar with various methods for 
introducing a DSBI enzyme into plastids or expressing it therein. 
The following may be mentioned by way of example, but not by 
limitation: 

5 

a) Nuclear expression using plastidic transit peptides 

An expression cassette encoding a DSBI enzyme fusion protein 
can be constructed in the manner known to the skilled worker , 

10 introduced into the nucleus and - optionally — integrated 

stably into the chromosomal DNA. For transport into the 
plastids , the DSBI enzyme is preferably expressed in fusion 
with a plastid localization sequence (PLS). Methods for the 
direct transportation, into the plastids / of proteins which 

15 per se are not localized in the plastids , and a variety of 

PLS sequences, have been described (Klosgen RB and Weil JH 
(1991) Mol Gen Genet 225(2) : 297-304? Van Breusegem F et al. 
(1998) Plant Mol Biol 38(3) :491-496) . Preferred are those PLS 
which, after translocation of the DSBI enzyme into the 

20 plastids, are cleaved enzymatically from the DSBI enzyme 

moiety. Especially preferred is the PLS which is derived from 
the plastidic Nicotiana tabacum transketolase or from another 
transit peptide (for example the transit peptide of the small 
Rubisco subunit or of the ferredoxin NADP oxidoreductase, and 

25 also isopentenyl-pyrophosphate isomerase-2) or its functional 

equivalent. Promoters which are suitable for expression in 
the nucleus are, in principle, all those which make possible 
an expression in plants. Examples can be found further below. 
Preferred are constitutive promoters such as the CaMV 35S 

30 promoter or the nitrilase-1 promoter of the Arabidopsis nitl 

gene (GenBank Acc. No.; Y07648.2, nucleotides 2456 to 4340; 
Hillebrand H et al. (1998) Plant Mol Biol 36 (l):89-99; 
Hillebrand fl et al. (1996) Gene 170(2)5 197-200). 

35 Preferred PLS sequences are: 

i) the Arabidopsis isopentenyl isomerase (IPP) transit 
peptide (GenBank Acc. No.: NC_003074; nucleotides 604657 
- 604486) 

40 

ii) transit peptides derived from the small subunit (SSU) of 
ribulose-bisphosphate carboxylase (Rubisco ssu) from, for 
example, pea, maize, sunflower or Arabidopsis. 



45 



Arabidopsis thaliana: GenBank Acc. No,: for example 
AY054581, AY054552; 
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pea, GenBank Acc. No.: for example X00806, 
nucleotides 1086 to 1256; X04334, X04333 (Hand JM 
(1989) EMBO J 8(ll):3195-206). Especially preferred 
in this context are: expression cassette and transit 
5 peptide (pea, rbcS3A) from vector pJIT117 (Guerineau 

F (1988) Nucleic Acids Res 16(23 ): 11380. Especially 
preferred is the peptide sequence as shown in SEQ ID 
NO: 35. Most preferred for the use in constructing 
suitable expression constructs is the nucleic acid 
10 sequence as shown in SEQ ID NO: 34. 

maize, GenBank Acc. No.: for example 842568, S42508 

sunflower, GenBank Acc. No.: Y00431, nucleotides 301 
15 to 465. 

iii) transit peptides derived from plant fat biosynthesis 
genes, such as the transit peptide of the plastid acyl 
carrier protein (ACP) (for example the Arabidopsis 
20 thaliana beta-ketoacyl-ACP synthetase 2; GenBank Acc. 

No.: AF318307), stearyl-ACP desaturase, (1-ketoacyl-ACP 
synthase or acyl-ACP thioesterase (for example A. thaliana 
mRNA for acyl- (acyl carrier protein) thioesterase: GenBank 
Acc. No.: Z36911) . 



25 



30 



iv) the GBSSI (starch granule bound synthase I) transit 
peptide 

v) the transit peptide of the LHCP II genes. 



Especially preferred is the PLS of the plastidic tobacco 
transketolase (SEQ ID NO: 36). To express corresponding 
fusion proteins, different PLS nucleic acid cassettes can be 
used in the three reading frames as KpnI/BamHI fragments for 
35 the purposes of the present invention (the translation start 

(ATG codon) is localized in the Ncol cleavage site) (pTP09 
SEQ ID NO: 37; pTPIO SEQ ID NO: 38; pTPll SEQ ID NO: 39). 

A further example of a PLS to be employed advantageously is 
40 the transit peptide of the plastidic Arabidopsis thaliana 
isopentenyl-pyrophosphate isomerase-2 (IPP-2) (SEQ ID NO: 
40). The nucleic acid sequences encoding three cassettes 
(corresponding to the three reading frames) of the PLS from 
the Arabidopsis thaliana isopentenyl-pyrophosphate 
45 isomerase-2 (IPP-2) can be used very especially preferably 
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(EcoRV/Sall cassettes with ATG; IPP-9 SEQ ID NO: 41; IPP-10 
SEQ ID NO: 42; IPP-11 SEQ ID NO: 43). 

The nucleic acids according to the invention can be of 
5 synthetic origin or have been obtained naturally or comprise 

a mixture of synthetic and natural nucleic acid components, 
or else consist of various heterologous gene segments from a 
variety of organisms. 

10 The sequence encoding the transit peptide can comprise all or 

part of the peptide sequence of the original protein. An 
accurate determination of the amino acid residues which are 
essential for the transport is not required as long as the 
functionality of the PLS - which is the transport into the 

15 plastid - is ensured and the function of the DSBI enzyme is 
not entirely destroyed. Very especially preferred are the 
following PLS sequences: 

PLS 1 : N-MASSSSLTLSQAILSRSVPRHGSASSSQLSPSSLTFSGLKSNPNITTSRRR 
20 TP S S AAAAAWRSPAIRAS AATET IEKTETAGS -C (SEQ ID NO: 36). 

Corresponds to the PLS of the tobacco plastidic 
transketolase . 

PLS2 : N-MSASSLFNLPLIRLRSLALSSSFSSFRFAHRPLSSISPRKLPNFRAFSGTA 
25 MTDTKDGSRVDM-C (SEQ ID NO: 40). Corresponds to the 

PLS of isopentenyl-pyrophosphate isomerase-2 ( IPP-2 ) , 
the last methionine preferably being the start 
methionine of the DSBI enzyme. 

30 The homing endonuclease as shown in SEQ ID NO: 69 is a 

preferred fusion protein of the native I-Ppo I nuclease and 
the IPP plastid localization sequence. This protein is 
preferably encoded by the sequence with the SEQ ID NO: 68. 

35 For the purposes of the present invention, fusion proteins of 
PLS and DSBI enzyme come under the term "DSBI enzyme". If a 
DSBI enzyme is expressed in the nucleus, the DSBI enzyme is 
preferably understood as meaning a fusion protein of PLS and 
DSBI enzyme. 

40 The invention furthermore relates to fusion proteins of DSBI 

enzymes with plastid localization sequence (PLS), sequences 
and expression cassettes comprising a fusion protein of a 
plastid localization sequence ( PLS) and a DSBI enzyme under 
the control of a promoter which is functional in the plant 

45 nucleus. Such suitable promoters are known to the skilled 

worker and described further below. The expression cassette 
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can comprise further elements such as, for example, 
transcription terminators and/or selection markers. 

b) Expression in plastids 

5 

An expression in plastids can also take place by the direct 
introduction of an expression cassette for the DSBI enzyme 
into plastids, if appropriate integration into the plastidic 
DNA, and expression of the DSBI enzyme. In a preferred 
10 embodiment, this expression cassette is present in the 

transformation construct which comprises the insertion 
sequence. 

Promoters which can be employed are, firstly, specific 
plastid or chromoplast promotors as detailed hereinbelow. 

15 However, a directed expression in plastids can also be 

achieved by using for example a viral, bacterial or 
bacteriophage promoter, introducing the resulting expression 
cassette into the plastidic DNA, and then inducing expression 
by the corresponding viral, bacterial or bacteriophage RNA 

20 polymerase. The corresponding RNA polymerase, in turn, can be 
introduced into the plastids in various ways, preferably by 
nuclear transformation in the form of a fusion protein with a 
PLS. Suitable methods have been described (WO 95/16783, 
WO 97/06250, US 5,925,806). It is preferably introduced into 

25 plastids by microinjection, especially preferably by means of 

particle bombardment. 

c) Introduction in the form of RNA 

30 The DSBI enzyme can also be introduced into plastids by 
introducing the mRNA — for example mRNA which has been 
generated in vitro — which encodes the DSBI enzyme via, for 
example, microinjection, particle bombardment (biolistic 
methods), or polyethylene glycol- or liposome-mediated 

35 transfection. This embodiment is advantageous since no 

DSBI -enzyme -encoding sequences remain in the plastome or 
genome in this case. Preferably, the RNA encoding the DSBI 
enzyme is generated by in-vitro transcription in the manner 
with which the skilled worker is familiar. 

40 

d) Introduction in the form of the protein 

The DSBI enzyme can be introduced into plastids directly, for 
example via microinjection, particle bombardment (biolistic 
45 methods) or polyethylene glycol transfection or 

liposome -mediated transfection. This embodiment is 
advantageous since no DSBI-en2yme-encoding sequences remain 
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in the plastome or genome. Such a method is described, for 
example, by Segal DJ et al. (1995) Proc Natl Acad Sci USA 
92.-806-810. 



The DSBI enzyme can be introduced into plant cells as a 
fusion protein with the VirE2 or VirP protein of an 
agrobacterium and a PLS. Such methods have been described for 
example for Cre recombinase (Vergunst AC et al. (2000) 
Science 290:979-982). This embodiment is advantageous since 
no DSBI enzyme-encoding sequences remain in the genome. 

Of course, combinations of the above -de scribed possibilities are 
also feasible. 



15 The expression cassette for the DSBI enzyme is preferably present 
on the insertion sequence or on the transformation construct. 

The DSBI enzyme is preferably introduced, or activated, 
simultaneously with, or after, the introduction of the insertion 
20 sequence into the plastids. Expression and/or activation at the 
correct site and the correct point in time can be ensured by 
various approaches: 



a) Inducible expression 

The expression of a DSBI enzyme can be controlled using an 
inducible promoter, preferably a chemically inducible 
promoter. To this end, for example, the expression cassette 
which encodes the DSBI enzyme can be transformed stably into 
the plastidic or nuclear DNA of a master plant. If it is 
transformed into the nuclear genome, the subcellular 
localization must be ensured — as described above — by 
suitable PLS transit peptides. Shortly before or during the 
transformation with the insertion sequence or the 
transformation construct, the expression of the DSBI enzyme 
will then be switched on by applying a suitable inductor, 
which depends on the choice of the inducible system. The 
skilled worker is familiar with a variety of methods or 
promoters for induced expression. Chemical compounds or else 
physical stimuli such as, for example, increased temperature 
or wounding and the like can act as stimulus. Various 
examples are described further below. 



b) 



Inducible activity 
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The DSBI enzyme can already exist in the plastids of the 
master plant when the activity is induced by suitable 
techniques at the selected point in time only, for example by 
addition of chemical compounds. Such methods have been 
5 described for sequence-specific recombinases (Angrand PO et 
al. (1998) Nucl Acids Res 26( 13) :3263-3269; Lqgie C and 
Stewart AF (1995) Proc Natl Acad Sci USA 92 ( 13) : 5940-5944; 
Imai T et al. (2001) Proc Natl Acad Sci USA 98( 1) :224-228) . 
Fusion proteins of the DSBI enzyme and the ligand binding 
10 domain of a steroid hormone receptor (for example the human 
androgen receptor, or mutated variants of the human estrogen 
receptor as described therein) are employed in these methods. 
Induction can be effected with ligands such as f for example, 
estradiol, dexamethasone, 4-hydroxytamoxifen or raloxifen. 

15 

Some of the DSBI enzymes are active in the form of the dimer 
(homo- or heterodimer) (I-Crel forms a homodimer; I-Ppol 
forms a homodimer, Flick KE et al. (1998) Nature 394: 
96-101). In general, enzymes of the LAGLIDADG family tend to 

20 form homodimers when only one LAGLIDADG motif is present per 
monomer (Jurica MS & Stoddard BL (1999) Cell Mol Life Sci 
55:1304-1326; I-Ceul may be mentioned by way of example). A 
dimerization can be designed to be inducible, for example by 
substituting the natural dimerization domains by the binding 

25 domain of a low-molecular -weight ligand. Addition of a 

dimeric ligand then brings about the dimerization of the 
fusion protein. Such inducible dimerization methods are 
described, as is the preparation of the dimeric ligands 
(Amara JF et al. (1997) Proc Natl Acad Sci USA 94(20): 

30 10618-10623; Muthuswamy SK et al. (1999) Mol Cell Biol 

19(10): 6845-6857; Schultz LW and Clardy J (1998) Bioorg Med 
Chem Lett 8(l):l-6; Keenan T et al. (1998) Bioorg Med Chem. 
6(8):1309-1335). 

35 c) Cotransfection 

The expression cassette encoding the DSBI enzyme is 
preferably introduced into the plastids simultaneously with 
the insertion sequence. In this context, the expression 

40 cassette for the DSBI enzyme and the insertion sequence may 
be present on one DNA molecule or else on two separate DNA 
molecules. Preferably, the two sequences are present together 
on one DNA molecule, so that the expression cassette is 
present in the transformation construct comprising the 

45 insertion sequence. 
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In an especially preferred embodiment, the sequence encoding 
the DSBI enzyme is removed from the genome of the transformed 
plasmids after homoplastomic plants have been regenerated. 
The skilled worker is familiar with a variety of methods for 
5 doing so which are detailed further below. 

Some of the above-described DSBI enzymes (in particular homing 
endonucleases) can have recognition sequences in the intermediate 
host E. coli, which is preferably used. Since, moreover, some 

10 expression cassettes for expression in plastids are also 

functional in S ♦ coli, it is preferred to prevent expression of 
the DSBI enzyme in E. coli in various ways with which the skilled 
worker is familiar in order to avoid any disadvantageous effects 
on E. coli during amplification of the expression cassette. Thus, 

15 for example, several consecutive codons which occur rarely in E. 
coli (for example the codons AGA and AGG, which encode arginine) 
can be inserted into the reading frame of the DSBI enzyme. This 
prevents expression in E • coli, but — owing to the different 
codon usage — continues to make possible expression in the 

20 plastids. As an alternative, promoters which are not active in E. 
coli, but are active in the plastids of higher plants can be used 
(for example promoters of the nuclear encoded plastidic RNA 
polymerases [NEP promoters; see hereinbelow] ) . A preferred method 
is the use of promoters which are recognized neither by the 

25 plastids nor by E. coli (for example viral promoters or 

bacteriophage promoters) and which only become functional by the 
simultaneous presence of the corresponding viral/bacteriophage 
RNA polymerase. Such methods are known to the skilled worker and 
described hereinbelow. Furthermore, it is feasible to destroy the 

30 relevant DSB recognition sequences in E. coli or to use a 

different host which has no DSB recognition sequences for the 
DSBI enzyme in question. Moreover, it is feasible and 
advantageous to have the coding region of the DSBI enzyme in 
promoterless form for amplification in E. coli. In this case, the 

35 sequence which encodes the DSBI enzyme is preferably present on a 
plasmid which is capable of integration into the plastidic genome 
of the plant to be transformed. Here, the integration site can be 
chosen in such a way that the gene encoding the DSBI enzyme comes 
under the control of a promoter which is naturally present in the 

40 plastome or has been inserted artificially into the plastome, 
thus resulting in expression of the DSBI enzyme in the plastids. 
A further preferred embodiment ensures that the gene encoding the 
DSBI enzyme can later be deleted from the plastome (see 
hereinbelow) , In addition, it is possible to create a linkage 

45 between a promoter and a DSBI enzyme by adding such a promoter in 
vitro upstream of the open reading frame by means of PCR 
techniques with which the skilled worker is familiar. The PCR 
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product can then be used for introduction into the plant 
plastids. Moreover, nonfunctional parts of an expression cassette 
for a DSBI enzyme can be generated and amplified in E. coli when 
these parts undergo recombination with one another after 
5 introduction into plant plastids (for example by means of 
homologous recombination in overlapping regions of the 
nonfunctional moieties of the expression cassette), thus giving 
rise to a functional expression cassette. 

10 "Recognition sequence for the directed induction of dna 

double-strand breaks" (hereinbelow "DSB recognition sequence" for 
double-strand break recognition sequence) generally refers to 
those sequences which permit recognition and cleavage by a DSBI 
enzyme under the conditions in the plastids of the plant cell or 

15 plant used in each case. Especially preferred are OSB recognition 
sequences for homing endonucleases which are encoded naturally in 
mitochondria or the nucleus of other organisms. Also, it is 
possible to use DSB recognition sequences of homing endonucleases 
which are derived from plastids (for example from green algae). 

20 Preferably, the DSB recognition sequence is singular in the 
plastidic DNA, i.e. a double-strand break is only generated at 
the location thus predefined. However, cases where more than one 
DSB recognition sequence is present in the plastome are also 
feasible. This is the case in particular when the DSB recognition 

25 sequence is localized in duplicated genes (for example in 

inverted repeats). In the latter case, more than one identical 
DSB recognition sequence exist, but their context is identical, 
so that, again, directed insertion takes place. Indeed, it is 
preferred that integration into all copies takes place, which 

30 also requires cleavage in all copies. DSB recognition sequences 
which, while occurring more than once in one plastome, are 
localized in the same plastomic context (for example in repeats 
or in gene duplications) come under the term "singular DSB 
recognition sequences" for the purposes of the present invention. 

35 

Preferably, the plant employed, or the cell derived therefrom, is 
predominantly homoplastomic or homotransplastomic with regard to 
the DSB recognition sequence, i.e. the predominant number of the 
plastidic DNA molecules present in the plastid contain this DSB 
40 recognition sequence. For the purposes of the present invention, 
such plants are also referred to as master plants. 



45 
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In principle, two types of DSB recognition sequences can be used: 

a) Natural, endogenous DSB recognition sequences 

5 As has been demonstrated within the scope of the present 

invention, the plastomes of higher plants comprise various 
sequences which can act as recognition sequences for DSBI 
enzymes (for example homing endonucieases) r even though no 
such endonucieases have been demonstrated in higher plants to 

10 date. Such DSB recognition sequences can be identified by 

screening the plastidic DNA sequence using the known DSB 
recognition sequences (for example those described in Table 
2 ) . The plastidic genome of various plants is known 
( http : / /megasun . bch . umontreal . ca/ogmp/pro j ects /other /cp_list . 

15 html). The sequences of the plastomes of the following have 

been reported: 

Arabidopsis thaliana < Sato S et al. (1999) DNA Res. 6 
(5):283-290) (GenBank Acc. No.: AP000423; NCBI Acc. No. 
20 NC_000932) 

Epifagus virginiana (Beechdrops; Wolfe KH et al. (1992) J 
Mol Evol 35(4) : 304-3 17; NCBI Acc. No.: NCJD01568? GenBank 
Acc. No.: M81884) 



25 



Lotus japonicus (Kato T et al. (2000) DNA Res 7(6):323- 
330? NCBI Acc. No.: NC_002694; GenBank Acc. No. : AP002983 ) 



Oryza sativa (rice) (Hiratsuka J et al. (1989) Mol Gen 
30 Genet 217 (2-3 ): 185-194 ; NCBI Acc. No.: NC_001320; GenBank 

Acc. -No: X15901), 

Marchantia polymorpha (Liverwort? Ohyama K et al. (1988) 
J Mol Biol 203(2) : 28 1-298; Yamano Y et al. (1984) Nucl 
35 Acids Res 12(11) : 4621-4624; GenBank Acc. No.: X04465 and 

Y00686; NCBI Acc. No.: NC_001319) 

- Nicotiana tabacum (tobacco) (GenBank Acc. No.: Z00044 and 
S54304; NCBI Acc. No.: NC_001879; Shinozaki K et al. 
40 (1986) EMB0 J 5:2043-2049) 

Oenothera elata ssp. hookeri (Monterey evening primrose; 
GenBank Acc. No.: AJ271079; NCBI Acc. No.: NCJ)02693; 
Hupfer H et al. (2000) Mol Gen Genet 263 ( 4 ) : 58 1-585) 



45 



Medicago truncatula (Gen Bank Acc. No.: AC093544) 
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PinuB thunbergii (black pine; Tsudzuki J et al. (1994) 
Curr Genet 26(2) :153-158; NCBI Acc. No. :NC_001631? 
GenBank Acc. No.: D17510) 

5 - Spinacia oleracea (GenBank Acc. No.: AJ400848 J01442 

M12028 M16873 M16878 M27308 M55297 X00795 X00797 X01724 
X04131 X04185 X05916 X06871) 

Triticum aestivum (wheat; GenBank Acc. No.: AB042240; 
10 NCBI Acc. No.: NC_002762) and 

Zea mays (GenBank Acc. No.: X86563; NCBI Acc. No.: 
NCJ)01666) 



15 In addition, further plastomes can be sequenced in order to 
identify DSB recognition sites therein. In general , it 
suffices to isolate highly-conserved regions from the 
plastorae by PCR methods with which the skilled worker is 
familiar and to sequence these regions only. 

20 

Furthermore, it is possible to determine natural, endogenous 
DSB recognition sites experimentally, for example by 
isolating the plastidic DNA (for example by the method of 
Mariac P et al. (2000) BioTechniques 28:110-113), amplifying 
25 the plastidic genome fragments to be taken into consideration 

by means of PCR or by using synthetic fragments and carrying 
out a restriction analysis with the DSBI enzyme in question. 
This restriction analysis is preferably carried out under 
conditions as they prevail in the plastid of a higher plant. 

30 

Moreover, the endogenous DSB recognition sequences for 
natural homing endonucleases which have been identified and 
described in Table 1 within the scope of the present 
invention are located in the conserved regions of the 

35 plastome so that — in particular taking into consideration 

the given variability, with regard to their respective 
recognition sequences, of the homing endonucleases mentioned 
in each case — it can be assumed that these recognition 
sequences are found virtually universally in all the 

40 plastomes of higher plants. The positions shown in Table 1 

reveal in each case the sequence stated and the 
reverse-complementary sequence, since all of the recognition 
regions shown in Table 1 are localized in the inverted 
repeat of the plastidic genome. Homing endonucleases which 

45 are especially preferred among those mentioned in Table 1 are 

I-Cpal, I-Ceul, I-Chul, I-CpaII and I-Crel. 
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The recognition sequences identified thus can be used for the 
insertion of foreign DNA by generating a double-strand break 
by introducing the corresponding DSBI enzyme. If the DSB 
recognition sequence were to be located in a highly-conserved 
region within a gene of the organelle genome, the foreign DNA 
is preferably inserted in the form of a self -splicing intron, 
which allows the reconstitution of the mRNA of the affected 
gene (see hereinbelow) ♦ 

The skilled worker is furthermore familiar with methods in 
which any endogenous sequence can act as recognition sequence 
for chimeric, mutated or artificial endonucleases, by 
subjecting their DNA recognition region to directed 
modification, for example by modification of a zinc finger 
domain fused to an endonuclease domain, or by modification of 
the RNA sequence of a group II intron RNA/protein complex 
(see hereinabove; WO 96/06166, Bibikova M et al. (2001) Mol 
Cell Biol 21:289-297). 
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Tab 1: Preferred endogenous cleavage sites in the plastidic 
genomes of tobacco, wheat, rice, maize and Arabidopsis. 
^complementary. Acc. No: GenBank Accession Number 
( http : / /www . ncbi . nlm. nih . gov/ ) . 

Singular cleavage sites of restriction endonucleases also 
exist in the plastidic genome. However, they are usually 
located in less highly-conserved regions and can therefore 
not necessarily be exploited universally in all plant 
species. The following may be mentioned by way of example: 

a) With the sequence GGCCTTTATGGCC the enzyme Sfil has a 
singular recognition site in the plastidic genome of 
Arabidopsis (GenBank Acc. No.: AP000423) at position 
40846-40858. F 

b) m the plastidic genome of maize (GenBank Acc. No.: 
X86563), there is a singular cleavage site for the enzyme 
ASCI at position 42130-42137, with the sequence GGCGCGCC. 

c) In the plastidic genome of rice (GenBank Acc. No.: 
X159019), there is a singular cleavage site for the 
enzyme Sgfl at position 77309-77316, with the sequence 
GCGATCGC, and for the enzyme AscI at position 39776-39783 
with the sequence GGCGCGCC. 

d) In the plastidic genome of tobacco (Accession Z00044), 
there is in each case a singular cleavage site for the 
enzyme Sfil at position 42475-42487, with the sequence 
GGCCTTTATGGCC, for the enzyme Sgrl at position 
78522-78529, with the sequence CACCGGCG, and for the 
enzyme Pmel at position 120895-120902, with the sequence 
GTTTAAAC . 

e) In the plastidic genome of wheat (Accession AB042240), 
there is in each case a singular cleavage site for the 
enzyme Pmel at position 59331-59338, with the sequence 
GTTTAAAC, a singular cleavage site for the enzymes Narl, 
Kanl, Ehel and Bbel at position 41438-41443, with the 
recognition sequence GGCGCC, and a recognition region for 
the enzyme Sfil at position 112656-112668, with the 
sequence GGCCCAGGGGGCC . 

All these plants with endogenous, natural DSB recognition 
sequences constitute, in a manner of speaking, naturally 
occurring master plants. In them, the DSB recognition 
sequence is naturally present in homoplastomic form. This 
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eliminates the need for the introduction and selection of 
artificial DSB recognition sequences. 

b) Artificially introduced DSB recognition sequences 

5 

The skilled worker realizes that the recognition region for a 
rare enzyme introduced into a master plant need not be of 
natural origin. In principle, any recognition sequence of any 
DSBI enzyme can be inserted at any position of the plastidic 

10 DNA. The preparation is preferably carried out using a 
construct for inserting the DSB recognition sequence 
(hereinbelow DSBR construct). Preferably, the DSBR construct 
comprises a selection marker to facilitate the selection of 
transplastomic plants with the successfully inserted DSB 

15 recognition sequence, which selection is required for 

generating suitable master plants. The skilled worker is 
familiar with a variety of selection markers which make 
possible selection of plastids (see hereinbelow). aadA, nptll 
or BADH are preferred, with aadA being especially preferred. 

20 Selection is carried out for example with the aid of the 

"segregation and sorting" process, with which the skilled 
worker is familiar (described by way of example in Example 
4). The selection marker is preferably constructed in such a 
way that a subsequent deletion from the plastome is made 

25 possible. Such methods are known to the skilled worker and 

described hereinbelow. 

Thus, it is preferred first to generate a plant which is 
homotransplastomic with regard to the inserted DSB 
30 recognition sequence and which has a DSB recognition sequence 

in all or the predominant number of the plastids of the plant 
in question. Such plants can advantageously be employed as 
master plants. 

35 In addition to the selection marker, the DSBR construct may 

comprise further sequences. These may contain for example 
further regulatory elements for the expression of the 
insertion sequences to be introduced subsequently. In a 
preferred embodiment, the selection marker introduced within 

40 the construct for insertion of the DSB recognition sequence 

is deleted after obtaining the homoplastomic master plant by 
methods known to the skilled worker (see hereinbelow). 

In a preferred embodiment, the DSBR construct comprises, for 
45 making possible a site-specific insertion, further flanking 

sequences at at least one, preferably at both sides of the 
DSB recognition sequence, which flanking sequences have 
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sufficient length and homology with corresponding target 
sequences in the plastome to ensure site-specific insertion 
by means of homologous recombination. 

5 Owing to the large number of the DSBI enzymes with defined 

recognition sequences which have been described in the prior art, 
it is possible , and preferred, to generate master plants which 
have a plurality of different singular DSB recognition sequences 
incorporated into their plastidic genome. 

0 

The recognition sequences for the respective DSBI enzymes listed 
are mentioned hereinbelow in Table 2 by way of example, but not 
by limitation. 



15 Table 2: Recognition sequences and source organism of the DSBI 
enzymes (" A " shows the cleavage site of the DSBI enzyme within a 
recognition sequence . ) 



30 



40 



DSBI 
enzyme 


Source 
organism 


Recognition sequence 


I -Anil 


Aspergillus 
nidulans 


5 ' -TTGAGGAGGTT * TCTCTGTAAATAANNNNNNNNNNNNNNN 
3 '-AACTCCTCCAAAGAGACATTTATTNinraNNNNNNNNNNN A 


I-Cvul 


Cnlorella 
vulgaris 


5 ' -CTGGGTTC AAAACGTCGTGA A GACAGTTTGG 
3 ' -GACCCAAGTTTTGCAG A CACTCTGTCAAACC 


I-CsmI 


Chlamydomonas 
smithii 


5 ' - GTAC T AGCATGGGGTCAAATGTCTTTCTGG 


I-Cmoel 


C hi amydomo nas 
moewusii 


5 ' -TCGTAGCAGCT A C ACGGTT 
3 ' -AGC ATCG A TCGAGTGCCAA 


I-Crel 


Chlamydomonas 
reinhardtii 


5 ' -CTGGGTTCAAAACGTCGTGA* GACAGTTTGG 
3 ' -GACCCAAGTTTTGC AG A CACTCTGTCAAACC 


I-Chul 


Chlamydomonas 
humicola 


5 ' -GAAGGTTTGGCACCTCG^ATGTCGGCTCATC 
3 ' -CTTCCAAACCGTG A GAGCTACAGCCGAGTA 


I-Cpal 


Chlamydomonas 

pallido- 

stigmatica 


5 ' -CGATCCTAAGGTAGCGAA* ATTCA 
3 '-GCTAGGATTCCATC A GCTTTAAGT 


I-CpaII 


Chlamydomonas 

pallido- 

stigmatica 


5 '-CCCGGCTAACTC^TGTGCCAG 
3 '-GGGCCGAT A TGAGACACGGTC 


I-Ceul 


Chlamydomonas 
eugametos 


5 ' -CGTAACTATAACGGTCCTAA A GGTAGCGAA 
3 • -GC ATTGATATTGCCAG A GATTCCATCGCTT 


I-Dmol 


Desulfuro- 

coccus 

mobilis 


5 ' -ATGCCTTGCCGGGTAA^GTTCCGGCGCGCAT 
3 '-TACGGAACGGCC A CATTCAAGGCCGCGCGTA 


I-Scel 


Saccharomyces 
cerevisiae 


5 '-AGTTACGCTAGGGATAA*CAGGGTAATATAG 
3 ' — TCAATGCGATCCC A TATTGTCCCATTATATC 
5 ' -TAGGGATAA A CAGGGTAAT 

3 ' -ATCCC^TATTGTCCCATTA ("Core" sequence) 


I-Scell 


Saccharomyces 
cerevisiae 


5 ■ -TTTTGATTCTTTGGTCACCC A TGAAGTATA 
3 ' - AAAACT AAGAAACC AG A TGGGACTTC AT AT 


I-Scelll 


Saccharomyces 
cerevisiae 


5 ' - ATTGGAGGTTTTGGTAAC A TATTTATTACC 
3 ' -TAACCTCCAAAACC A ATTGATAAAT AATGG 



0817/00028 



CA 02470329 2004-06-14 



36 





DSBI 
enzyme 


Source 
organism 


Recognition sequence 




l*Od>Bl V 


Saccharomyces 
cerevisiae 


5 
3 


' -TCTTTTCTCTTGATTA A GCCCTAATCTACG 
' - AG AAAAGAG AAC ~ TAATCGGG ATT AG ATGC 


5 




Saccharomyces 
cerevisiae 


5 
3 


' -AATAATTTTCT * TCTT AGTAATGCC 
' -TTATT AAAAGAAGAATC ATTA A CGG 




T Cs*At7T 

I-bCevi 


Saccharomyces 
cerevisiae 


5 
3 


' -GTTATTTAATG * TTTTAGTAGTTGG 
' -CAATAAATTAC AAAATCATC A * ACC 


10 


i-scevxi 


Saccharomyces 
cerevisiae 


5 


' -TGTCACATTGAGGTGCACTAGTTATTAC 


PI-SceI 


Saccharomyces 
cerevisiae 


5 
3 


' -ATCTATGTCGGGTGC A GGAGAAAGAGGTAAT 
' -TAGATACAGCC *CACGCCTCTTTCTCCATTA 




F-Scel 


Saccharomyces 
cerevisiae 


5 
3 


' -GATGCTGTAGGC * ATAGGCTTGGTT 
' -CTACGACA" TCCGTATCCGAACCAA 


15 


r-sceii 


Saccharomyces 
cerevisiae 


5 
3 


' - C TTTCCGC AACA * GT AAAATT 
' -GAAAGGCG" TTGTCATTTTAA 




I-Llal 


Lactococcus 
lactis 


5 
3 


' — CACATCCATAAC^CATATCATTTTT 
' -GTGTAGGTATTGGTAT AGTAA* AAA 




I-Msol 


Monoroastix 
species 


5 
3 


' -CTGGGTTCAAAACGTCGTGA* GACAGTTTGG 
' -G ACCC AAGTTTTGC AG A C ACTCTGTC AAACC 


20 


I-NanI 


tiaegleria 
andersoni 


5 
3 


f -AAGTCTGGTGCCA * GC ACCCGC 
' -TTCAGACC * ACGGTCGTGGGCG 




I-NitI 


Naegleria 
italica 


5 
3 


' - AAGTCTGGTGCC A ~GC ACCCGC 
' —TTCAGACC A ACGGTCGTGGGCG 




I-Njal 


Naegleria 
jamiesoni 


5 
3 


' -AAGTCTGGTGCCA A GCACCCGC 
' -TTCAGACC ~ ACGGTCGTGGGCG 


25 


I-PakI 


Pseudendo- 

clonium 

aJcinetum 


5 
3 


1 -CTGGGTTCAAAACGTCGTGA* GACAGTTTGG 
' -GACCCAAGTTTTGCAG A CACTCTGTCAAACC 




I-Porl 


Pyrobaculum 
organotrophum 


5 
3 


' -GCGAGCCCGT AAGGGT * GTGT ACGGG 
' -CGCTCGGGCATT'CCCACACATGCCC 


30 


I-Ppol 


Phys arum 
polycephalum 


5 ' -TAACTATGACTCTCTTAA A GGT AGCCAAAT 
3 ' -ATTGAT ACTGAGAG* AATTCC ATCGGTTT A 

"Core sequence": 

CTCTCTTAA A GGTAGC 
GAG AG * AATTCC ATCG 


35 


X-dCai. 


Saccharomyces 
capensis 


5 
3 


' -TGTC ACATTGAGGTGCACT * AGTTATTAC 
• - ACAGTGTAACTCCAC A GTGATC AATAATG 


I-Ssp6803I 


Synechocystis 
species 


5 
3 


' -GTCGGGCT " CATAACCCGAA 
' -CAGC CCG AGT A A TTGGGCTT 




Pl-Pful 


Pyrococcus 
furiosus Vcl 


5 
3 


' -GAAGATGGGAGGAGGG ~ ACCGGACTCAACTT 
' -CTTCTACCCTCC A TCCCTGGCCTGAGTTGAA 


40 


pi-pfuir 


Pvrococcus 
furiosus Vcl 


5 
3 


' -ACGAATCCATGTGGAGA A AGAGCCTCTATA 
' -TGCTTAGGTACAC * CTCTTCTCGGAGATAT 




Pl-Pkol 


Pyrococcus 

kodakaraensis 

K0D1 


5 
3 


' -GATTTTAGAT A CCCTGTACC 
' -CTAAAA A TCT AGGGAC ATGG 


45 


PI-PkoII 


Pyrococcus 

kodakaraensis 

K0D1 


5 
3 


' -C AGT ACTACG A GTTAC 
' -GTC ATG A ATGCCAATG 




PI-PspI 


Pyrococcus 
sp. 


5 
3 


' - AAAATCCTGGC AAACAGCT ATTAT A GGGTAT 
' -TTTT AGGAC C GTTTGTC GAT A AAT ACCCATA 
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DSBI 
enzyme 


Source 
organism 


Recognition sequence 


PI-Tful 


Thermococcus 

fumicolans 

ST557 


5 ' - T AG ATTTTAGGT a CGCTAT ATCCTTCC 
3 ' - ATCTAAAA* TCCAGCGATATAGGAAGG 


PI-TfuII 


Thermococcus 

fumicolans 

ST557 


5 ' -TAYGCNGAYACN A GACGGYTTYT 
3 ' — ATRCGNCT HTfytof*n*nnn-a r * -n * 


fi-Tnyi 


Thermococcus 

hydro- 

thermalis 


*' iwivaujJUjAiALW wACGGYTTYT 
3 '-ATRCGNCT~RTGNCTGCCRAARA 


PI-Tlil 


Thermococcus 
litoralis 


5 ' -TAYGCNGAY ACNGACGG ~ YTTYT " " 

3 ' -ATRCGNCTRTGNC A TGCCRAARA 


PI-Tlill 


Thermococcus 
litoralis 


5 ' -AAATTGCTTGCAAACAGCTATTACGGCTAT 


I-TevI 


T4 


5 ' - AGTGGTATCAAC * GCTCAGTAGATG 
3 ' -TCACCATAGT* TGCGAGTC ATCTAC 


I-Tevll 


Bacteriophage 
T4 


5 ' — GCTTATGAGTATGAAGTGAACACGT * TATTC 
3 ' -CGAAT ACTCATACTTC ACTTGTG * C AATAAG 


F-TevI 


Bacteriophage 
T4 


5 ' -GAAACACAAGA* AATGTTTAGTAAANNNNNNNNNNNNNN 
3 ' — CTTTGTGTTCTTTACAAATCATTTNNNNNNNNNNNNNN A 


F-TevII 


Bacteriophage 
T4 


5 • -TTTAATCCTCGCTTC * AGATATGGCAACTG ~ ~ 

3 ' - AAATTAGGAGCGA* AGTCTATACC GTTGAC 



10 



15 



20 



30 



Also comprised are deviations (degenerations) of the recognition 
25 sequence which nevertheless continue to make possible recognition 
and cleavage by the DSBI enzyme in question. Such deviations 
- also in connection with different framework conditions such as, 
for example, calcium or magnesium concentrations - have been 
described (Argast GM et al. (1998) J Mol Biol 280: 345-353). 
Furthermore comprised are core sequences of these recognition 
sequences. It is known that the inner portions of the recognition 
sequences also suffice for an induced double-strand break and 
that the outer portions are not necessarily relevant, but may 
have an effect on the cleavage efficiency. Thus, for example, an 
18 bp core sequence can be defined for I-Scel. The term "DSB 
recognition sequence" thus also comprises all essentially 
identical recognition sequences. Essentially identical 
recognition sequences refers to those recognition sequences 
which, while deviating from the recognition sequence identified 
40 as being optimal for the enzyme in question, still permit 
cleavage by the same. 



35 



Various localization sites (in the case of already existing 
endogenous DSB recognition sequences) or integration sites (in 
45 the case of artificially generated DSB recognition sequences) are 
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possible for the DSB recognition sequence. Examples which may be 
mentioned are: 

a) Localization (integration) in a transcriptionally silent 
5 region 

Localization (integration) of the DSB recognition sequence in 
a transcriptionally silent region of the plastidic genome 
(intergenic region) is the preferred embodiment, in this 
10 manner, an adverse effect on the plastids' functions can be 

largely ruled out- In this context, it must be noted that, if 
appropriate, suitable regulatory elements such as promoters 
and the like must also be introduced for expression to take 
place. 

IS 

b) Localization (integration) in a transcriptionally active but 
noncoding ( intercistronic ) region 

The advantage of this localization (integration) is that the 
20 insertion sequence to be introduced is thereby ultimately 

encoded in a plastidic operon and promoter (s) /terminator (s) 
need not be introduced separately, but those present 
endogenously at this locus can, but do not have to, be 
utilized. In such a case, only ribosome binding sites should 
25 be present at a suitable distance upstream of the coding 

region of the foreign genes to be introduced. 

However, it is also feasible that an intergenic region is not 
entirely transcriptionally silent , for example because 
30 transcriptional termination from an adjacent gene or operon 
is only inefficient. 

c) Localization (integration) in a transcriptionally active 
coding region. 

35 

The localization (integration) described under a) and b) of 
the DSB recognition sequence at a noncoding locus has the 
advantage that the insertion of the foreign DNA is highly 
likely not to affect the function of the plastidic genome. 

40 However, noncoding regions are less well conserved than 

coding regions. In order to have available as universal a 
method as possible which works in many plant species, the DSB 
recognition sequence (and therefore the insertion sequence) 
is, in an especially preferred embodiment, localized in the 

45 coding sequence of an existing gene. Destruction of the gene 

function by introducing the DSB recognition sequence (in the 
case of an artificially generated DSB recognition sequence), 
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or the introduction of the insertion sequence, is prevented 
inventively, in a preferred variant of this embodiment, by 
introducing the DSB recognition sequence, or the insertion 
sequence, within an intron. in this manner, the complete 
coding mRNA is regenerated at the site of integration by 
splicing the pre-RNA of the gene. 



DSB recognition sequences which do not occur naturally in the 
plastidic DNA can be introduced into the plastidic DNA in various 
10 ways. Examples which may be mentioned are: 



a) Integration by means of double crossover 



Integration into the plastidic genome is preferably carried 
15 out with the aid of the above-described methods with which 

the skilled worker is generally familiar (double crossover). 

b) Integration using natural, endogenous DSB recognition 
sequences 

20 

c) integration using recombinases and corresponding recognition 
sequences . 



Even though the procedure for inserting an artifical DSB 
25 recognition sequence into the plastidic DNA is relatively 
complicated and, in case a), corresponds to the plastid 
transformation method currently described in the prior art, this 
complicated procedure only has to be carried out once. The 
resulting homotransplastomic master plant can then be employed 
30 for any number of different subsequent transformations using the 
method according to the invention, which makes possible a 
substantial increase in the transformation efficiency: instead of 
having to carry out the conventional selection process for a 
homotransplastomic plant every single time, it only has to be 
35 carried out once in the present context. 

"Deactivation of the functionality" of a DSB recognition sequence 
means that, owing to insertion of the insertion sequence at or 
near the position of the double-strand break, the DSB recognition 
40 sequence is destroyed, i.e. the corresponding DSBI enzyme no 
longer recognizes the region and, accordingly, no longer induces 
a double-strand break at this position. 



Construction of the transformation construct with the insertion 
45 sequence 
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Using one of the above-described master plants or cells derived 
from them which contain a natural and/or an artificially 
generated DSB recognition sequence in the plastome, the insertion 
sequence is inserted into said DSB recognition sequence within a 
5 transformation process. This is effected with the simultaneous 
presence of a DSBI enzyme, which recognizes one of the DSB 
recognition sequences in the plastome. 

In its simplest form, the transformation construct consists only 
10 of the insertion sequence itself, for example of an expression 
cassette which is to ensure the expression of a certain gene in 
the plastids. The sequence-specific induction of double-strand 
breaks suffices to ensure that this insertion sequence is placed 
at this position and thus to bring about the deactivation of the 
15 DSB recognition sequence. 

In a preferred embodiment, the insertion sequence comprises at 
least one nucleic acid sequence to be expressed. To ensure 
expression (transcription and/or translation), they axe to be 

20 provided with regulatory elements, depending on the embodiment 
and the insertion site. If insertion takes place at a 
transcriptionally active locus, no promoter sequences are 
required, as described above. The sequences to be expressed are 
advantageously provided in any case with ribosome binding sites 

25 at a suitable distance upstream of the open reading frame, or are 
already equipped naturally with such sites. These regulatory 
sequences or parts thereof can, however, also be present 
naturally in the plastome or introduced into the plastidic DNA 
together with the DSB recognition sequence as early as in the 

30 first step, i.e. in the generation of a nonnatural master plant. 

An increase of the insertion efficiency and insertion accuracy 
can be brought about by flanking the insertion sequence present 
in the transformation construct and the DSB recognition sequence 

35 by homologous sequence regions which, owing to the induced 
double-strand break, ensure homologous recombination. In a 
preferred embodiment, the insertion sequence comprises flanking 
homology sequences A' and B', the sequence to be introduced into 
the plastidic DNA being located between A' and B'. The DSB 

40 recognition sequence is flanked by homology sequences A and B, 
respectively, the DSB recognition sequence being located between 
A and B. A and B can be of natural origin or have been introduced 
in context with the insertion of nonnatural DSB recognition 
sequences. A and A' and B and B', respectively , are sufficiently 

45 long and sufficiently homologous to one another to ensure a 
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homologous recombination between A and A', and B and B' 
respectively. ' 

in a further embodiment, the DSB recognition sequence is flanked 
5 merely by a homologous sequence A which has sufficient homology 
to a sequence A' which flanks the insertion sequence 
unilaterally. 

With regard to the homology sequences, -sufficient length* 
10 preferably means sequences with a length of at least 20 base 

ll L V' 17 at - lea8t 50 bSSe PairS ' ••P~i*"y Preferably 

at least 100 base pairs, very especially preferably at least 250 
base pairs, most preferably at least 500 base pairs. 

15 With regard to the homology sequences A and A', and B and B' 
respectively -sufficient homology- preferably means sequences 
which have at least 70%, preferably 80%, by preference at least 
90%, especially preferably at least 95%, very especially 

™ ^^f^ ^ l6aSt " % ' P refer «bly 100% homology within 

20 these homology sequences over a length of at least 20 base pairs 
preferably at least 50 base pairs, especially preferably at leasi 
100 base pairs, very especially preferably at least 250 base 
pairs, most preferably at least 500 base pairs. 

25 Homology between two nucleic acids is understood as meaning the 
identity of the nucleic acid sequence over in each case the 
entire sequence length which is calculated by alignment with the 
aid of the program algorithm GAP (Wisconsin Package Version 10 o 

in !r mty0f w «consin, Genetics Computer Group (GCG) , Madison,' 

30 USA), setting the following parameters: 

Gap Weight: 12 Length Weightj 4 

^ Average Match: 2,912 Average Mi8match{ _ 2f003 

Since homologous recombination is promoted by the induced 
double-strand break, the requirements regarding length and 
homology of the sequences are markedly less than is the case for 
example m the case of conventional homologous recombination. In 
40 this context, the homologous regions can also be markedly smaller 
than 250 bp. The advantage of using homology sequences is that, 
when A' and B' are different, or when only one homology sequence 

LT^? US ^' 3 dlreCted insertion of «• insertion sequence 
into the plastidic DNA can take place. 

45 
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The transformation construct or the insertion sequence preferably 
comprises a selection marker which makes possible the selection 
of transplastomic plastids (see hereinbelow) , especially 
preferably aadA, BADH or a binding-type marker. The selection 
5 marker is preferably constructed in such a way that subsequent 
deletion from the plastome is made possible. Such methods are 
known to the skilled worker and described hereinbelow. 

The insertion sequence or the transformation construct preferably 
10 has the structure and sequence of an intron. As a rule, the 
naturally occurring introns are modified in such a manner for 
this purpose that they meet the requirements of the method 
according to the invention. Such artificial introns are 
especially preferred when they are to be inserted into a 
15 transcriptionally active or even coding region, for example, 

using a natural, endogenous DSB recognition sequence. Preferably, 
insertion takes place in such a way that the inserted sequence is 
removed completely by splicing the pre-mRNA. The RNA which has 
been spliced out (that is to say the artificial intron) now 
20 constitutes the mRNA, for example for the translation of proteins 
encoded on it. This method has further advantages: 

The introns utilized show pronounced secondary folding so 
that a relatively stable RNA results • The genes of interest 
25 which are encoded in the intron can therefore be expressed at 
a particularly high level, as has been demonstrated, for 
example, in E. coli (Chan KYW et al. (1988) Gene 73:295-304). 

When the intron is integrated into a gene, the transcription 

30 of the intron is subject to the regulatory control of the 

gene into which the intron has been integrated. This is why 
all regulatory elements upstream or downstream of the gene(s) 
of interest can be dispensed with in the intron. The 
constructs can thus be kept correspondingly small, and it is 

35 certain that transcription does indeed work, including in the 

species under investigation. The utilization of heterologous 
regulatory elements involves the residual risk that these 
elements are not functional in the investigated plastids of 
the plant species in question. The utilization of homologous 

40 sequences can, owing to the sequence duplication, lead to 

spontaneous recombination events with the endogenous 
sequences and thus to instability of the organelle genome. 
Owing to the possibility of largely being able to dispense 
with the introduction of regulatory elements — for example by 

45 encoding the gene of interest in an intron which is inserted 

into a transcriptionally active plastome region — many other 
disadvantages of conventional plastid transformation can be 
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avoided with the method according to the invention in this 
embodiment, in addition to increasing the insertion and 
distribution ability of the transformation constructs. 

5 Moreover, all introns can be used when the relevant factors which 
mediate splicing are simultaneously expressed in the plastids or 
imported into them. Preferably, the splicing factors are encoded 
in the intron itself. Group n introns, which themselves encode 

in tt t ° f thS SpllCe factors - «e especially preferred in 
10 this embodiment. They include the Lactococcus Ll.ltrB intron 

Likewise preferred introns are those which naturally occur in the 
plastids of higher plants, especially group II introns, very 
especially preferably introns which encode a protein, most 
preferably introns of the trnK genes of the plastidic genome, m 
15 the latter case, the introns from the trnK genes of the plastids 
from the species Arabidopsis, maize and tobacco are especiallv 
preferred. ^ y 

Preferred introns are those which have a self -splicing activity 
20 which does not depend on further protein factors, or introns 

which utilize general factors for splicing which are universally 
present, and therefore also in plastids, and also introns which 
themselves encode factors required for splicing. These introns 
include, for example, 

25 

a) the group I intron from Tetrahymena (GenBank Acc. No • 

X54512; Kruger K et al. (1982) Cell 31:147-157; Roman J and 
Woodson SA (1998) Proc Natl Acad Sci USA 95:2134-2139) 

30 b) the group II rll intron from Scenedesmus obliguus (GenBank 
Acc. No.: X17375.2 nucleotides 28831 to 29438; Hollander V 
and Kiick U (1999) Nucl Acids Res 27: 2339-2344; Herdenberger 
P et al. (1994) Nucl Acids Res 22: 2869-2875; Kiick U et al 
(1990) Nucl Acids Res 18:2691-2697) 

35 

c) the Ll.LtrB intron (GenBank Acc. No.: U50902 nucleotides 2854 
to 5345) 

d) the Arabidopsis trnK intron (GenBank Acc. No.: AP000423, 
40 complementary nucleotides 1752 to 4310) 

e) the maize trnK intron (GenBank Acc. No.: X86563, 
complementary nucleotides 1421 to 3909) 

45 f) the tobacco trnK intron (GenBank Acc. No.: Z00044, 
complementary nucleotides 1752 to 4310). 
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Not only heterologous introns, but also introns which naturally 
occur in the plastids of the plant in question can be utilized. 
Heterologous introns — for example heterologous trnK introns — 
are preferred to avoid instabilities brought about by sequence 
5 duplication. In a preferred embodiment , introns which occur 
naturally in the plastids of the plant in question are modified 
in such a way that they have a sequence homology of less than 
95%, preferably 80%, especially preferably 70% with the sequence 
of the starting intron, while still being able to retain their 
10 function. 

In a further preferred embodiment , a factor which brings about 
splicing of the intron in question is available in trans, i.e. it 
is not encoded in the intron itself. If this factor is not 

15 naturally present in the plastid in question, but first has to be 
introduced into it, such a procedure can be effected in various 
ways with which the skilled worker is familiar. Examples which 
may be mentioned are the introduction of a suitable coding 
sequence, which is capable of expression, into the plastome or 

20 the introduction into the nuclear DNA; in the latter case, the 
factor is preferably fused with a PLS. 

Especially preferred introns are those which naturally encode a 
DSB enzyme (in particular a homing endonuclease) . Especially 
25 preferred is the intron Cp.LSU2 from Chlamydomonas 

pallidostigmatica, which encodes the enzyme I-Cpal (Turmel M et 
al. (1995) Mol Biol Evol 12:533-545). Also preferred are the 
group II introns from yeast mitochondria. 

30 In a preferred embodiment, the intron sequence is adapted to suit 
the insert site so that they can splice at this locus. In the 
case of group I introns, this adaptation can relate to the 
internal guide sequence (IGS) and in the case of the group II 
introns the exon binding sequence (BBS) I and/or II. 

35 

In the case of the maize trnK intron, it must be noted that the 
protein encoded by the trnK intron, which also comprises the 
maturase function, is probably not functional in its naturally 
encoded form without editing. It has been demonstrated that 

40 editing (His420Tyr) of the corresponding mRNA takes place in 
barley plastids (Vogel J et al. (1997) J Mol Biol 270:179-187). 
Tyrosine at position 420 of the matK protein is highly conserved, 
in the monocots rice and maize, too, a codon encoding His has 
been found at the corresponding position in the coding DNA. It 

45 can therefore be assumed that the matK transcript is also edited 
in those plants, as is the case in barley. Since, however, other 
plant species may, if appropriate, not be able to provide such 
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RNA editing, a preferred embodiment provides that the matK gene 
in the maize trnK intron is already modified at DNA level by a 
suitable His/Tyr substitution, so that RNA editing is no longer 
required. For example, the sequence CATTATCATAGTGGAT of the maize 
5 trnK intron can be mutated into CATTATTATAGTGGAT • 

in the case of group I introns, the splicing site is determined 
by the pairing of IGS with the exon of the corresponding 
transcript, which exon is located 5' and/or 3' relative to the 

10 intron (Lambowitz AM & Belfort M (1993) Annu Rev Biochem 

62:587-622). Using techniques which are known to the skilled 
worker, such as PCR or the synthetic generation of nucleotide 
sequences, the IGS can be matched to any group I introns in such 
a way that splicing takes place at the predefined insertion site 

15 within the DSB recognition region. The modified IGS is designed 
in such a way that it can undergo — at least partial — base 
pairing with the sequences of the transcript 5' and 3' of the 
insertion site. The C. pallidostigmatica CpLSU2 intron, which 
encodes the homing endonuclease I-Cpal, is preferably utilized. 

20 If this intron is utilized in connection with the expression of 
the DSBI enzyme I-Cpal, whereby insertion of the DNA to be 
transformed into the 23S rDNA of the plastidic genome of higher 
plants results, no adaptation of the intron is necessary. 
Insertion takes place at a locus in the plastidic genome of 

25 higher plants which is homologous to the locus at which the 

intron is naturally present in C. pallidostigmatica. This intron 
is therefore already designed in such a way that pairings with 
the 5' and 3' exon can be undergone and that correct splicing in 
this nucleotide environment takes place. Furthermore preferred is 

30 the group I intron from Tetrahymena thermophila, where, as a 
413 bp intervening sequence (IVS), it interrupts the 26S rRNA 
coding region (Accession V01416 J01235 nucleotides 53 to 465). 
The IGS with the sequence 5'-ggaggg-3' which can be found natu- 
rally (Waring RB et al. 1985 Cell 40: 371-380; Been, MD & Cech, 

35 TR 1986 Cell 47: 207-216) can be adapted to the new insertion 

site by techniques with which the skilled worker is familiar. If, 
for example, integration into the DSB recognition site of 
the I-Cpal enzyme at the position identified by * (cggtcct A aagg- 
tagcgaaattc) is desired, the mutated, adapted IGS can, for exam- 

40 pie, have the following sequence: 5 ' -gggacc-3 ' . 

In group II introns, which are mobile, further activities in 
addition to maturase are frequently encoded in the protein moiety 
of the ribonucleoprotein complex. However, these are not 
45 necessarily required for the method described and can therefore 
be deleted. Indeed, deletion is preferred since it makes the 
construct in question smaller and easier to handle. The skilled 
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worker is familiar with a variety of options for removing such 
activities from the protein moiety. For example, this can be 
effected by generating a synthetic gene which comprises only the 
desired regions, or by suitable PCR methods. 

5 

Self-splicing group II introns have a conserved structure and 
generally consist of 6 different domains. Domain I comprises the 
exon binding sites (EBS1 and EBS2) which, during the splicing 
procedure, interact with the exon located 5' from the intron. In 

10 addition, an interaction between the "6 region" (located 

immediately 5' of EBS1) and the "6 r region" at the 3' exon takes 
place (Lambowitz AM & Belfort M (1993) Annu Rev Biochem 
62:587-622; Michel F & Ferat JL (1995) Annu Rev Biochem 
64:435-461). These sequences can be adapted by techniques with 

15 which the skilled worker is familiar, such as synthetic 

generation of the introns or suitable PCR methods, in each case 
in such a way that correct choice of the splicing sites at the 
insert site chosen in the DSB recognition region is ensured. This 
is done in such a way that the regions mentioned are modified so 

20 that base pairings with the corresponding sequences upstream 
(intron binding sequences, IBS) and downstream (6') of the 
artificial insertion sequence can be undergone. If, for example, 
cggtcctaaggt'agcgaaattc is chosen as insertion site ( A ) for the 
Ll.LtrB intron in the I-Cpal recognition region, the 6 region and 

25 the EBS1 region can, for example, adopt the sequence TCGCTACCTTAG 
(natural sequence: TTATGGTTGTG ) , and EBS2 for example the 
sequence GACCG (natural sequence: ATGTG) . If the Arabidopsis 
thaliana trnK intron is selected, the 5 region and the EBS1 region 
can, for example, adopt the sequence CGC TACCTTAGG (natural 

30 sequence: AATGTTAAAAA) , assuming the same insertion site as 
indicated for the Ll.LtrB intron. 

If the DSB recognition sequence takes the form of a natural, 
endogenous recognition sequence of a homing endonuclease , a 
35 selected intron is preferably inserted at the site of the DSB 
recognition region at which the intron belonging to the homing 
endonuclease in question can also be found naturally. 

The artificial insertion site of an intron in the DSB recognition 
40 site is preferably chosen such that 5' and 3' of the intron 
inserted as many bases as possible correspond to those of the 
natural insertion site of the intron in question and that the DSB 
recognition sequence is no longer functional after insertion of 
the intron. Very especially preferably, the nucleotide located in 
45 each case immediately upstream or downstream of the insert site 
of the intron corresponds to that at the natural insertion site. 
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In an especially preferred embodiment, the intron is flanked by 
homology sequences in order to make possible a directed * 
insertion. Here, the homology sequences are - as described above 
- homologous to the sequences flanking the DSB recognition 
5 sequence and thus make possible an accurate insertion. 

The invention therefore furthermore also relates to DNA 
constructs comprising at least one nucleic acid and intron 
sequence elements which are capable of ensuring, in a ribonucleic 
10 acid sequence derived from said DNA construct, the deletion of 
the ribonucleic acid fragment encoding said nucleic acid 
sequence, where said nucleic acid sequence is heterologous with 
regard to said intron sequence elements. 

15 In a preferred embodiment, the nucleic acid sequence is flanked 
at least by a splice acceptor sequence and a splice donor 
sequence. 

In a further embodiment, the DNA construct comprises, at the 5' 
20 and the 3' end, sequences HI and H2, respectively, which have 
sufficient length and homology with plaatid sequences HI' and 
H2 ' , respectively, to ensure homologous recombination between Hi 
and HI', and H2 and H2', respectively, and thus insertion of the 
HI- and H2-flanked sequence into the plastome. 

25 

The invention furthermore relates to a transgenic plastidic DNA 
comprising at least one nucleic acid sequence and intron sequence 
elements which are capable of ensuring, in a ribonucleic acid 
sequence derived from said transgenic plastidic DNA, the deletion 
30 of said ribonucleic acid fragment encoding said nucleic acid 
sequence, where said nucleic acid sequence is heterologous with 
regard to said intron sequence elements, in a preferred 
embodiment, the nucleic acid sequence is flanked by at least one 
splice acceptor sequence and one splice donor sequence. 

35 

To construct a transformation vector, the insertion sequence or 
the transformation construct can be cloned into a standard vector 
such as pBluescript or pUC18. in a preferred embodiment, the 
insertion sequence or the transformation construct is applied as 
40 a linear or linearized DNA molecule. 

Preferably, only the portion of transformation vector which 
comprises the insertion sequence or the transformation construct 
with, if appropriate, homology sequences, selection marker and/or 
45 the expression cassette for the DSBI enzyme is applied. If all or 
some of the homology sequences are dispensed with, the linearized 
DNA molecule is preferably obtained by digestion with restriction 
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endonucleases which generate single-stranded DNA overhangs at one 
or at both ends which are compatible with those generated by the 
DSBI enzyme in the plastidic DNA. 

5 In a preferred embodiment , the transformation vector can comprise 
elements (for example a plastidic ORI {origin of replication)) , 
which make it possible for the vector autonomously to replicate 
in the plastid or stably to exist in the plastids as 
extrachromosomal DNA molecule, before being integrated into the 
10 plastidic DNA. Such methods are known to the skilled worker 
(US 5,693,507; US 5,932,479; WO 99/10513). This method is 
preferred since it increases the copy number of the insertion 
sequences which is available for integration in the plastid. 

15 One of the above-described constructs can be introduced into the 
plastids of a suitable master plant using one of the methods 
described. Microinjection is preferred, particle bombardment is 
particularly preferred. 

20 Cloning, expression, selection and transformation methods 

"Expression cassette" means — for example regarding the 
expression cassette for the DSBI enzyme - those constructions in 
which the DNA to be expressed is in operable linkage with at 

25 least one genetic control element which makes possible or 

regulates its expression (i.e. transcription and/or translation). 
In this context, expression can be, for example, stable or 
transient, constitutive or inducible. A variety of direct methods 
(for example transf ection, particle bombardment, microinjection) 

30 or indirect methods (for example agrobacterial infection, viral 
infection) Btated hereinbelow is available to the skilled worker 
for the introduction, and these methods will be stated 
hereinbelow. 

35 An operable linkage is generally understood as meaning an 

arrangement in which a genetic control sequence can exert its 
function with regard to a nucleic acid sequence, for example 
encoding a DSBI enzyme. Function, in this context, can mean for 
example the control of the expression, i.e. transcription and/or 

40 translation, of the nucleic acid sequence, for example encoding a 
DSBI enzyme. Control, in this context, comprises for example 
initiating, increasing, governing or suppressing the expression, 
i.e. transcription and, if appropriate, translation. Governing, 
in turn, can be tissue and/or timing-specific. It may also be 

45 inducible, for example by certain chemicals, stress, pathogens 
and the like. 
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An operable linkage is understood as meaning for example the 
sequential arrangement of a promoter, of the nucleic acid 
sequence to be expressed - for example encoding a DSBI enzyme - 
and, if appropriate, further regulatory elements such as, for 
5 example, a terminator in such a way that each of the regulatory 
elements can fulfill its function upon expression of the nucleic 
acid sequence, for example encoding a DSBI enzyme. In this 
context, operable linkage need not necessarily exist on the 
transformation constructs themselves. Operable linkage can also 

10 result as a consequence of the insertion into the nuclear or 

plastidic DNA, where the regulatory elements are already present 
in the nuclear or plastidic DNA. in this respect, the regulatory 
elements can be naturally present or else introduced in a 
preceding step, for example when introducing an artificial DSB 

15 recognition sequence. 

Direct linkage in the chemical sense is not necessarily required 
in this context. Genetic control sequences such as, for example, 
enhancer sequences, can also exert their function on the target' 

20 sequence from positions which are further away, or indeed from 
other DNA molecules. Preferred arrangements are those in which 
the nucleic acid sequence to be expressed - for example encoding 
a DSBI enzyme - is positioned behind a sequence which acts as 
promoter, so that the two sequences are bonded covalently with 

25 one another. Preferably, the distance between the promoter 

sequence and the nucleic acid sequence - for example encoding a 
DSBI enzyme - is less than 200 base pairs, especially preferably 
less than 100 base pairs, very especially preferably less than 50 
base pairs. 

30 

The skilled worker is familiar with a variety of routes to obtain 
one of the transformation constructs according to the invention, 
vectors comprising them or one of the expression cassettes. They 
can be prepared by means of customary recombination and cloning 

35 techniques as are described for example in Kaniatis T, Fritsch EF 
and Sambrook J, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, NY (1989) and in 
Silhavy TJ, Berman ML and Enquist LW, Experiments with Gene 
Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 

40 (1984) and in Ausubel FM et al., Current Protocols in Molecular 
Biology, Greene Publishing Assoc. and Wiley Interscience (1987). 
The direct fusion of a nucleic acid sequence which acts as 
promoter and a nucleotide sequence to be expressed - for example 
encoding a DSBI enzyme — is preferred. 
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The term "genetic control sequences" is to be understood in the 
broad sense and refers to all those sequences which influence the 
generation or the function of an expression cassette or 
transformation vector. Genetic control sequences ensure 
5 transcription and, if appropriate, translation in the nucleus (or 
cytoplasm) or plastids. Preferably, the expression cassettes 
according to the invention comprise a promoter 5' upstream of the 
respective nucleic acid sequence to be expressed and a terminator 
sequence as additional genetic control sequence 3' downstream, 
10 and, if appropriate, further customary regulatory elements, in 
each case in operable linkage with the nucleic acid sequence to 
be expressed. 

Genetic control sequences are described, for example, by 
15 "Goeddel; Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990)" or "Gruber and Crosby, in: 
Methods in Plant Molecular Biology and Biotechnolgy, CRC Press, 
Boca Raton, Florida, eds.: Glick and Thompson, Chapter 7, 89-108" 
and the references cited therein. 

20 

Examples of such control sequences are sequences to which the 
inductors or repressors bind and thus regulate the expression of 
nucleic acid. The natural regulation of these sequences may still 
be present before the actual structural genes, in addition to 

25 these novel control sequences or instead of these sequences, and, 
if appropriate, can have been genetically modified so that the 
natural regulation has been switched off and gene expression 
enhanced. However, the expression cassette can also be simpler in 
structure, that is to say no additional regulatory signals are 

30 inserted before the abovementioned genes and the natural promoter 
together with its regulation is not removed. Instead, the natural 
control sequence is mutated in such a way that regulation no 
longer takes place and gene expression is enhanced. These 
modified promoters can also be placed by themselves before the 

35 natural genes in order to increase the activity. 

Depending on the host organism or the starting organism described 
in greater detail hereinbelow, which is converted into a 
genetically modified or transgenic organism by the introduction 
40 of the expression cassettes or vectors, different control 
sequences are suitable. 

Promoters which are suitable for nuclear expression (for example 
of a viral/bacteriophage RNA polymerase or of a DSBI enzyme with 
45 plastidic transit peptide) are, in principle, all those which are 
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capable of governing the expression of genes r in particular 
foreign genes, in plants. 

Suitable promoters are those which make possible constitutive 
5 expression in plants (Benfey et al. (1989) EMBO J. 8:2195-2202). 
in particular, a plant promoter or a promoter derived from a 
plant virus is used by preference. Especially preferred is the 
promoter of the cauliflower mosaic virus 35S transcript (Franck 
et al. (1980) Cell 21:285-294; Odell et al. (1985) Nature 

10 313:810-812; Shewmaker et al. (1985) Virology 140:281-288; 

Gardner et al. 1986, Plant Mol. Biol. 6, 221-228) or the 19S CaMV 
promoter (US 5,352,605 and WO 84/02913). As is known, this 
promoter comprises different recognition sequences for 
transcriptional effectors which, in their totality, lead to 

15 largely permanent and constitutive expression of the gene 

introduced (Benfey et al. (1989) EMBO J 8:2195-2202). A further 
suitable constitutive promoter is the "Rubisco small subunit 
(SSU)" promoter (US 4,962,028). A further example of a suitable 
promoter is the leguminB promoter (GenBank Acc. No.: X03677). 

20 Examples of further preferred constitutive promoters are the 
Agrobacterium nopaline synthase promoter, the TR dual promoter, 
the Agrobacterium OCS (octopine synthase) promoter, the ubiquitin 
promoter (Holtorf S et al. (1995) Plant Mol Biol 29:637-649), 
the promoters of the vacuolar ATPase subunits, the FBPaseP 

25 promoter (WO 98/18940) or the promoter of a proline-rich protein 
from wheat (WO 91/13991). Other suitable constitutive promoters 
which are preferred for the purposes of the present invention are 
the Super promoter (Ni M et al. (1995) Plant J 7:661-676; 
US 5,955,646) and the nitrilase-1 promoter of the Arabidopsis 

30 nitl gene (GenBank Acc. No.: Y07648.2, nucleotides 2456 to 4340; 
Hillebrand H et al. (1998) Plant Mol Biol 36 (l):89-99; 
Hillebrand H et al. (1996) Gene 170(2) :197-200) . 

Promoters which are preferred are inducible promoters, especially 
35 preferably chemically inducible promoters (Aoyama T and Chua NH 

(1997) Plant J 11:605-612; Caddick MX et al. (1998) Nat. 
Biotechnol 16:177-180; Review: Gatz (1997) Annu Rev Plant Physiol 
Plant Mol Biol 48:89-108) by means of which expression can be 
controlled at a particular point in time. Examples which may be 

40 mentioned are the PRP1 promoter (Ward et al. (1993) Plant Mol 
Biol 22:361-366), a salicylic-acid-inducible promoter (WO 
95/19443), a benzenesulf onamide-inducible promoter 
(EP-A-0388186), a tetracyclin-inducible promoter (Gatz et al. 
(1992) Plant J 2:397-404), an abscisic-acid-inducible promoter 

45 (EP-A 335 528), an ethanol-inducible promoter (Salter MG et al. 

(1998) Plant J. 16:127-132), the heavy-metal-inducible 
metallothionein I promoter (Amini S et al. (1986) Mol Cell Biol 
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6:2305-2316), the steroid-indue ible MMTV LTR promoter (Izant JG 
et al. (1985) Science 229:345-352) and a cyclohexanone-inducible 
promoter (WO 93/21334). Especially preferred is the inducible 
expression of a PLS/DSBI enzyme fusion protein in the nucleus* 
5 inducible promoters also comprise those which are capable of 
regulation by certain repressor proteins (for example tet, lac). 
Such repressor proteins can be translocated into the plastids in 
fusion with PLS, where they regulate the expression of certain 
genes under the control of suitable promoters. In the plastids, 

10 the repressor binds to an artificial repressor binding site which 
has been introduced into the plastome and can thus repress the 
expression of the downstream gene (cf. WO 95/25787). In this 
manner it is possible, for example, to induce the expression of a 
plast id-encoded DSBI enzyme when required, or to repress it until 

15 the point in time at which expression is desired. 

Other promoters which are preferred are those which are induced 
by biotic or abiotic stress such as, for example, the 
pathogen-inducible promoter of the PRP1 gene (Ward et al., 
20 Plant Mol Biol 1993, 22: 361-366), the heat-inducible tomato 
hsp70 promoter or hsp80 promoter (US 5,187,267), the 
chill-inducible potato alpha-amylase promoter (WO 96/12814) or 
the wounding- induced pinll promoter (EP-A 0 375 091). 

25 In an especially preferred embodiment, the nucleic acid which 
encodes the DSBI enzyme is, above all, expressed under the 
control of an inducible promoter. A controlled expression capable 
of being governed is thus obtained, and any problems caused by 
expressing a DSBI enzyme constitutively are avoided. 

30 

Advantageous control sequences for the expression cassettes or 
vectors according to the invention comprise viral, bacteriophage 
or bacterial promoters such as cos, tac, trp, tet, phoA, tat, 
lpp, lac, laclq, T7, T5, T3, gal, trc r ara, SP6, X-PR or X-PL 
35 promoter. They are preferably employed in combination with the 
expression of the respective, corresponding RNA polymerase. 

The expression in plastids can be effected using plastid 
promoters and/or transcription regulation elements. Examples 

40 which may be mentioned, but not by way of limitation, are the RNA 
polymerase promoter (WO 97/06250) or the promoters described in 
WO 00/07431, US 5,877,402, WO 97/06250, WO 98/55595, WO 99/46394, 
WO 01/42441 and WO 01/07590. The rpo B promoter element, the atpB 
promoter element, the clpP promoter element (see also WO 

45 99/46394) or the 16S rDNA promoter element should be mentioned. 
In this context, the promoter can also have a polycistronic 
"operon" assigned to it (EP-A 1 076 095; WO 00/20611). Systems in 
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which a nonplant (for example viral) rna polymerase is imported 
into the plastid using plastidic transit peptides and 
specifically induces, in the plastid, the expression transgenic 
sequences which are under the control of the RNA polymerase 
5 recognition sequences and have previously been inserted into the 
plastxdic DNA have also been described (WO 95/16783- US 
5,925,806; US 5,575,198). 

in addition to the abovementioned promoters, the following can 
10 also be preferably used: 3 

a) the PrbcL promoter (SEQ ID NO: 44) 

b) the Prpsl6 promoter (SEQ id NO: 50) 

15 

c) the Prrnl6 promoter (SEQ id no: 46) 

In an especially preferred embodiment, NEP promoters are 
employed. These are promoters which are functional in plastids 
20 and are recognized by the nuclear-encoded plastidic RNA 
polymerases (NEP). The following are preferred: Prrn-62; 
Pycf2-1577; PatpB-289; Prps2-152; Prpsl6-107; Pycfl-41; 
Patpl-207; PclpP-511; PclpP-173 and PaccD-129 (WO 97/06250- 
Hajdukiewicz PTJ et al. (1997) EMBO J 16:4041-4048). 

25 

The following are especially preferred: 

a) the PaccD-129 promoter of the tobacco accD gene (WO 97/06250- 
SEQ ID NO: 47) ' 

30 

b) the PclpP-53 promoter of the clpP gene as highly active NEP 
promoter in chloroplasts (WO 97/06250; SEQ ID NO: 48) 

c) the Prrn-62 promoter of the rrn gene (SEQ id NO: 49) 

d) the Prpsl6-107 promoter of the rpsl6 gene (SEQ id NO: 45) 

e) the PatpB/E-290 promoter of the tobacco atpB/E gene (Kapoor S 
et al. (1997) Plant J 11:327-337) (SEQ ID NO: 51) 

40 

f) the PrpoB-345 promoter of the rpoB gene (Liere K & Maliga P 
(1999) EMBO J 18: 249-257) (SEQ ID NO: 52) 

In general, all those promoters which belong to class III 
45 (Hajdukiewicz PTJ et al. (1997) EMBO J 16:4041-4048) and all 
fragments of the class II promoters which control the initiation 
of transcription by NEP can be utilized in this preferred 
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embodiment* Such promoters or promoter moieties are not 
particularly highly conserved. ATAGAATAAA is given as consensus 
near the transcription initiation site of NEP promoters 
(Hajdukiewicz PTJ et al (1997) EMBO J 16:4041-4048). 

5 

Normally, genes are surrounded by regulatory sequences which 
originate from the plastids of the organism to be transformed. 
Thus, sequence duplications which can lead to instabilities owing 
to spontaneous, intrachromosomal homologous recombination events 

10 are generated (Heifetz PB (2000) Biochimie 82(6-7) : 655-666 ) . To 
overcome this problem, it has been proposed to utilize 
heterologous regulatory sequences or to exploit regulatory unit6 
which already exist endogenous ly in the plastidic genome (WO 
99/46394; WO 01/42441). A reduction of the homology by 

15 mutagenesis of the endogenous promoter sequence has also been 
described (WO 01/07590). 

In principle, all natural promoters together with their 
regulatory sequences, such as those mentioned above, can be used 

20 for the method according to the invention. Especially preferred 
promoters are those which have been isolated from prokaryotes • 
Very especially preferred are promoters isolated from 
Synechocystis or E.coli. In addition, synthetic promoters such 
as, for example, a synthetic promoter derived from the E. coli 

25 consensus sequence for a70 promoters 

5 9 -TTGACA Ni 6 .i9 TATAAT N 3 CAT —3 ' , 

where N represents any nucleotide (that is to say A, G, C or T) 
30 can additionally also be used advantageously. It is obvious to 
the skilled worker that individual or few base substitutions in 
the conserved regions stated are also possible without destroying 
the function of the promoter. The variable design of these 
synthetic promoters by using a variety of sequential sequences 
35 makes it possible to generate a multiplicity of promoters which 
lack extensive homologies, which increases the stability of the 
expression cassettes in the plastome in particular in the event 
that several promoters are required. The following, particularly 
preferred promoter sequences, which are derived from the 
40 abovementioned consensus sequence, may be mentioned by way of 
example, but not by limitation: 

a) 5 ' -TTGACATTCACTCTTCAATTATCTATAATGATACA-3 ' (SEQ ID NO: 53) 

b) 5 ' -TTGACAATTTTCCTCTGAATTATATAATTAACAT-3 9 (SEQ ID NO: 72) 

45 
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It is obvious to the skilled worker that these synthetic 
promoters can control the expression of any genes. For example, 
they can be utilized for driving the expression of a selection 
marker, also in order to be able to select under regenerative 
5 conditions for transplastomic plants with the aid of said 
selection system. Examples of selection markers are enumerated 
further below. In addition, such synthetic promoters can be 
linked with any gene, for example with genes encoding antibodies, 
antigens or enzymes. Preferably, the expression cassettes 
10 consisting of such promoters also comprise 5 '-untranslated 

regions (or ribosome binding sites) or 3'-noncoding regions which 
are detailed hereinbelow. 



The invention furthermore relates to expression cassettes 
15 comprising a nucleic acid sequence encoding a DSBI enzyme under 
the control of a promoter which is functional in plant plastids, 
for example one of the above-described promoters. The expression 
cassette can comprise further elements such as, for example, 
transcription terminators and/or selection markers. 

20 

Genetic control sequences also comprise further promoters, 
promoter elements or minimal promoters which are capable of 
modifying the expression-controlling properties. Genetic control 
sequences furthermore also comprise the 5 '-untranslated region 

25 (5'-UTR) or the noncoding 3' region (3'-UTR) of genes <Eibl C 
(1999) Plant J 19: 1-13). it has been demonstrated that these cai 
exert significant functions in regulating the gene expression in 
plastids of higher plants. In the nucleus, too, genetic control 
elements such as 5'-UTR, introns or 3'-utr, can exert a function 

30 in gene expression. Thus, for example, it has been demonstrated 
that 5 '-untranslated sequences can enhance the transient 
expression of heterologous genes. They can furthermore promote 
tissue specificity (Rouster J et al., Plant J. 1998, 15: 
435-440.). 

35 

5'-0TRs and 3'-UTRs which are preferably employed in plastids 
are: 



a) 5'psbA (from tobacco) (SEQ ID NO: 54) 

40 

b) 5'rbcL including 5' portions from the coding region of the 
rbcL gene (from tobacco) (SEQ ID NO: 55); the sequence 
described as SEQ id NO: 55 has been mutated in comparison 
with the native sequence in order to introduce a Pagl and , 

45 Ncol cleavage site. 
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c) 5'rbcLs (SEQ ID NO: 56); the sequence described by SEQ ID NO: 
56 has been mutated in comparison with the native sequence in 
order to introduce a Pagl cleavage site. 

5 d) 3'psbA-l from Synechocystis (SEQ ID NO: 57) 

e) 3'psbA from tobacco (SEQ ID NO: 58) 

f) 3'rbcL from tobacco (SEQ ID NO: 59) 

10 

Genetic control sequences, especially for expression in plastids, 
also comprise in particular ribosome binding sequences for 
initiating translation. They are usually present in the S'-UTRs. 
This is especially preferred when suitable sequences are not 

15 provided by the nucleic acid sequence to be expressed or when 
such sequences are compatible with the expression system. 
Especially preferred is the use of a synthetic ribosome binding 
site (RBS) with the sequence 5 ' -GGAGG(N)3_iqATG-3 ' , preferably 
5'-GGAGG(N) 5 ATG-3' (SEQ ID NO: 60), particularly preferably 

20 5 ' -GGAGGATCTCATG-3 ' (SEQ ID NO: 61). 

The expression cassette can advantageously comprise one or more 
what are known as enhancer sequences in operable linkage with the 
promoter; these enhancer sequences make possible an enhanced 

25 transgenic expression of the nucleic acid sequence. Additional 
advantageous sequences, such as further regulatory elements or 
terminators, may also be inserted at the 3' end of the nucleic 
acid sequences to be expressed recombinant ly. One or more copies 
of the nucleic acid sequences to be expressed recombinantly may 

30 be present in the gene construct. 

It is furthermore possible to insert, after the start codon, what 
is known as a downstream box, which enhances expression in 
general (translation enhancer WO 00/07431; WO 01/21782). 

35 

Polyadenylation signals which are suitable as genetic control 
sequences, above all in the transformation of the nucleus, are 
plant polyadenylation signals, preferably those which correspond 
essentially to T-DNA polyadenylation signals from AgrroJbacterium 
40 tumefaciens, in particular of gene 3 of the T-DNA (octopine 
synthase) of the Ti plasmid pTiACHS (Gielen et al., EMBO J. 3 
(1984), 835 et seq.) or functional equivalents thereof. Examples 
of especially suitable terminator sequences are the OCS (octopine 
synthase) terminator and the NOS (nopaline synthase) terminator. 

45 
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The transformation vectors and insertion sequences according to 
the invention may comprise further nucleic acid sequences. Such 
nucleic acid sequences can preferably constitute expression 
cassettes. The following may be mentioned by way of example of 
5 the DNA sequences to be expressed in the expression constructs 
but not by limitation: ' 

1. Selection markers 

10 "Selection markers" means all those nucleic acid or protein 
sequences whose expression (i.e. transcription and, if 
appropriate, translation) confers a phenotype to a cell, tissue 
or organism which differs from that of an untrans formed cell 
tissue or organism. Selection markers comprises for example those 

15 nucleic acids or protein sequences whose expression confers an 
advantage (positiver selection marker) or disadvantage (negative 
selection marker) on a cell, tissue or organism in comparison 
with cells which do not express this nucleic acid or protein. For 
example, positive selection markers act by detoxifying a 

20 substance which has an inhibitory effect on the cell (for example 
resistance to antibiotics /herbicides) , or by forming a substance 
which makes possible improved regeneration or enhanced growth of 
the plant under the selected conditions (for example nutritive 
markers, hormone-producing markers such as ipt; see hereinbelow) 

25 Another form of positive selection markers comprises mutated 

proteins or RNAs which are insensitive to a selective agent (for 
example 16S rRNA mutants, which are insensitive to 
spectinomycin) . Negative selection markers act for example by 
catalyzing the formation of a toxic substance in the transformed 

30 cells (for example the codA gene). Moreover, selection marker can 
also comprise reporter proteins as long as they are suitable for 
differentiating transformed from untransformed cells, tissues or 
organs (for example by coloration or another detectable 
phenotype) . 



35 



40 



The following selection markers may be mentioned by way of 
example, but not by limitation: 

1.1 Positive selection markers: 



The selectable marker introduced into the nucleus or the plastids 
together with the expression cassette confers resistance to a 
biocide (for example a herbicide such as phosphinothricin, 
glyphosate or bromoxynil), a metabolic inhibitor such as 
45 2-deoxyglucose-6-phosphate (WO 98/45456) or an antibiotic such 
as, for example, tetracyclins, ampicillin, kanamycin, G 418, 
neomycin, bleomycin or hygromycin, to the successfully 
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transformed cells. The selection marker permits the selection of 
the transformed cells from untransformed cells (McCormick et al., 
Plant Cell Reports 5 (1986), 81-84? Dix PJ & Kavanagh TA (1995) 
Euphytica 85: 29-34). 

5 

Especially preferred selection markers are those which confer 
resistance to herbicides. Selection markers which may be 
mentioned by way of example ares 

10 - DNA sequences which encode phosphinothricin 

acetyl transferases (PAT), which acetylate the free amino 
group of the glutamine synthase inhibitor phosphinothricin 
(PPT) and thus detoxify the PPT (de Block et al. 1987, EMBO 
J. 6, 2513-2518) (also referred to as Bialophos® resistance 

15 gene (bar)). The bar gene encoding a phosphinothricin 

acetyltransf erase (PAT) can be isolated for example from 
Streptomyces hygroscopicus or S. viridochromogenes. Such 
sequences are known to the skilled worker (from Streptomyces 
hygroscopicus GenBank Acc. No.: X17220 and X05822, from 

20 Streptomyces viridochromogenes GenBank Acc. No.: M 22827 and 

X65195; US 5,489,520). Synthetic genes are further described 
for example for expression in plastids. A synthetic PAT gene 
is described in Becker et al. (1994) The Plant J. 5:299-307. 
The genes confer resistance to the herbicide bialaphos or 

25 glufosinate and are widely used markers in transgenic plants 

(Vickers JE et al. (1996). Plant Mol Biol Reporter 
14:363-368: Thompson CJ et al. (1987) EMBO J 6:2519-2523). 

5 -Enolpyruvylshikimate-3 -phosphate synthase genes (EPSP 

30 synthase genes), which confer resistance to glyphosate 
(N-(phosphonomethyl)glycin) . The nonselective herbicide 
glyphosate has 5-enolpyruvyl-3-phosphoshikimate synthase 
(EPSPS) as molecular target. This enzyme has a key function 
in the biosynthesis of aromatic amino acids in microbes and 

35 plants, but not in mammals (Steinrucken HC et al. (1980) 

Biochem. Biophys. Res. Commun. 94:1207—1212; Levin JG and 
Sprinson DB (1964) J. Biol. Chem. 239: 1142-1150; Cole DJ 
(1985) Mode of action of glyphosate a literature analysis, 
p. 48-74. In: Grossbard E and Atkinson D (eds.). The 

40 herbicide glyphosate. Buttersworths , Boston.). 

Glyphosate-tolerant EPSPS variants are preferably used as 
selection markers (Padgette SR et al. (1996). New weed 
control opportunities: development of soybeans with a Roundup 
Ready™ gene. In: Herbicide Resistant Crops (Duke, S.O., ed.), 

45 pp. 53-84. CRC Press, Boca Raton , FL; Saroha MK and Malik VS 
(1998) J Plant Biochemistry and Biotechnology 7:65-72). The 
EPSPS gene of Agrobacterium sp. strain CP 4 has a natural 
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tolerance to glyphosate which can be transferred to 
corresponding transgenic plants. The CP4 EPSPS gene has been 
cloned from Agrobacteriura sp. strain CP4 (Padgette SR et al 
(1995) Crop Science 35(5) : 1451-1461 ) . Sequences 
of 5-enolpyrvylshikimate-3-phosphate synthases which are 
glyphosate-tolerant, such as, for example, those described 
in US 5,510,471; US 5,776,760; US 5,864,425; US 5,633,435- 
US 5,627,061, US 5,463,175; EP 0 218 571, are described in 
the patents and also deposited in GenBank. Further sequences 
are described under GenBank Accession X63374. The aroA gene 
(M10947 S. typhimurium aroA locus 

5-enolpyruvylshikimate-3-phosphate synthase (aroA protein) 
gene) is furthermore preferred. 

the gox gene (glyphosate oxidoreductase ) , which encodes the 
Glyphosat* degrading enzyme. GOX (for example the glyphosate 
oxidoreductase from Achromobacter sp.) catalyzes the cleavage 
of a c-N bond in glyphosate, which is thus converted into 
ananomethylphosphonic acid (AMPA) and glyoxylate. GOX can 
thereby confer resistance to glyphosate (Padgette SR et al 
(1996) J Nutr. 1996 Mar; 126(3) :702-16; Shah D et al. (1986) 
Science 233: 478-481). 

the deh gene (encoding a dehalogenase which inactives 
Dalapon®), (GenBank Acc. No.: AX022822, AX022820 and 
WO 99/27116) 

bxn genes, which encode bromoxynil-degrading nitrilase 
enzymes, for example the Klebsiella ozanenae nitrilase. 
Sequences can be found in GenBank for example under the Acc 
No: E01313 (DNA encoding bromoxynil specific nitrilase) and* 
J03196 (K. pneumoniae bromoxynil-specific nitrilase (bxn) 
gene, complete cds). 

Neomycin phosphotransferases confer resistance to antibiotics 
(aminoglycosides) such as neomycin, G418, hygromycin, 
paromomycin or kanamycin, by reducing their inhibitory action 
by means of a phosphorylation reaction. Especially preferred 
is the nptll gene. Sequences can be obtained from GenBank 
(AF080390 minitransposon mTn5-GNm; AF080389 minitransposon 
mTn5-Nm, complete sequence). Moreover, the gene is already a 
component in a large number of expression vectors and can be 
isolated from them using methods with which the skilled 
worker is familiar (such as, for example, polymerase chain 
reaction) (AF234316 pCAMBIA-2301; AF234315 pCAMBIA-2300, 
AF234314 pCAMBiA-2201) . The nptii gene encodes an 
aminoglycoside 3'-0-phosphotransferase from E.coli, Tn5 
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(GenBank Acc. No: U00004 position 1401-2300; Beck et al. 
(1982) Gene 19 327-336). Moreover, the Acinetobacter bauman- 
nii aphA-6 gene, which encodes an aminoglycoside phospho- 
transferase, may also be utilized as selection marker (Huang 
5 et al. (2002) Mol Genet Genomics 268:19-27) 

the DOG 11 ! gene. The gene DOG*l was isolated from the yeast 
Saccharomyces cerevisiae (EP 0 807 836). It encodes a 
2-deoxyglucose-6-phosphate phosphatase, which confers 
10 resistance to 2 -DOG (Randez-Gil et al. 1995, Yeast 11, 

1233-1240; Sanz et al. (1994) Yeast 10:1195-1202, sequence: 
GenBank Acc. No.; NC001140 chromosome VIII, Saccharomyces 
cervisiae position 194799-194056). 

15 - Sulfonylurea- and imidazolinone-inactivating acetolactate 
synthases, which confer resistance to 

imidazolinone/sulfonylurea herbicides. Examples which may be 
mentioned of imidazolinone herbicides are the active 
substances imazamethabenz -methyl, imazamox, imazapyr, 

20 imazaquin and imazethapyr. Examples of sulfonylurea 

herbicides which may be mentioned are amidosulforon, 
azimsulfuron, chlorimuronethyl, chlorsulfuron, cinosulfuron, 
imazosulfuron, oxasulfuron, prosulfuron, rimsulfuron, 
sulfosulfuron. The skilled worker is familiar with a large 

25 number of further active substances from the abovementioned 

classes. Nucleic acid sequences such as, for example, the 
sequence for the Arabidopsis thaliana Csr 1.2 Gen (EC 
4.1.3.18) which has been deposited under the GenBank Acc No.: 
X51514, are suitable (Sathasivan K et al. (1990) Nucleic 

30 Acids Res. 18(8):2188). Acetolactate synthases, which confer 

resistance to imidazolinone herbicides, are furthermore 
described under the GenBank Acc. Nos.: 

a) ABO 4 9 82 3 Oryza sativa ALS mRNA for acetolactate synthase, 
35 complete cds, herbicide resistant biotype 

b) AF094326 Bassia scoparia herbicide resistant acetolactate 
synthase precursor (ALS) gene, complete cds 

40 c) X07645 Tobacco acetolactate synthase gene, ALS SuRB 
(EC 4.1.3.18) 

d) X07644 Tobacco acetolactate synthase gene, ALS SuRA 
(EC 4.1.3.18) 



45 



e) A19547 Synthetic nucleotide mutant acetolactate synthase 
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5 



f ) 


A19546 


Synthetic nucleotide mutant acetolactate 


synthase 


g) 


A19545 


Synthetic nucleotide mutant acetolactate 


synthase 


h) 


105376 


Sequence 5 from patent EP 0257993 




i) 


105373 


Sequence 2 from patent EP 0257993 




j) 


AL133315 





Hygromycin phosphotransferases (X74325 P. pseudomallei gene 
for hygromycin phosphotransferase) which confer resistance to 
the antibiotic hygromycin. The gene is a component of a large 
number of expression vectors and can be isolated from them 
15 using methods with which the skilled worker is familiar (such 

as, for example, polymerase chain reaction) (AF294981 
PINDEX4; AF234301 pCAMBIA-1380; AF234300 pCAMBIA-1304 ; 
AF234299 pCAMBIA-1303; AF234298 pCAMBIA-1302 ; AF354046 
pCAMBIA-1305.; AF354045 pCAMBIA-1305 . 1 ) 

20 

genes for resistance to 



a) chloramphenicol (chloramphenicol acetyltransf erase) , 

25 b) tetracyclic various resistance genes have been 

described, for example X65876 S. ordonez genes class D 
tetA and tetR for tetracyclin resistance and repressor 
proteins X51366 Bacillus cereus plasmid pBC16 tetracyclin 
resistance gene. Moreover, the gene is already a 

30 component of a large number of expression vectors and can 

be isolated therefrom using methods known to the skilled 
worker (such as, for example polymerase chain reaction) 

c) Streptomycin; various resistance genes have been 

35 described, for example with the GenBank Acc. No.:AJ278607 

Corynebacterium acetoacidophilum ant gene for 
streptomycin adenylyltransf erase . 

d) Zeocin; the corresponding resistance gene is a component 
40 of a large number of cloning vectors (for example L36849 

cloning vector pZEO) and can be isolated from these using 
methods known to the skilled worker (such as, for 
example, polymerase chain reaction). 

45 e) Ampicillin (B-lactamase gene; Datta N, Richmond MH. 

(1966) Biochem J. 9B(l):204-9; Heffron F et al. (1975) 
J. Bacterid 122: 250-256; the Amp gene was first cloned 
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for generating the E. coli vector pBR322; Bolivar F et 
al. (1977) Gene 2:95—114). The sequence is a component of 
a large number of cloning vectors and can be isolated 
from them using methods known to the skilled worker (such 
5 as, for example, polymerase chain reaction). 

Genes such as the isopentenyl transferase from Agrobacterium 
tumefaciens (strain:P022) (Genbank Acc. No.: AB025109). The 
ipt gene is a key enzyme of cytokinin biosynthesis. Its 

10 overexpression facilitates the regeneration of plants (for 

example selection on cytokinin-f ree medium). The method for 
utilizing the ipt gene has been described (Ebinuma H et al. 
(2000) Proc Natl Acad Sci USA 94:2117-2121; Ebinuma, H et al. 
(2000) Selection of marker-free transgenic plants using the 

15 oncogenes (ipt, rol A, B, C) of Agrobacterium as selectable 

markers. In Molecular Biology of Woody Plants. Kluwer 
Academic Publishers ) . 

Various other positive selection markers which confer a 
20 growth-related advantage to the transformed plants over the 
nontrans formed plants, and methods for their use, have been 
described, inter alia, in EP-A 0 601 092, Examples which may be 
mentioned are (^-glucuronidase (in conjunction with, for example, 
cytokinin glucuronide) , mannose-6-phosphate isomerase (in 
25 conjunction with mannose), UDP-galactose 4-epimerase (in 
conjunction with, for example, galactose), with 
mannose-6-phosphate isomerase in conjunction with mannose being 
especially preferred. 

30 Preferred for a selection marker which is functional in plastids 
are, in particular, those which confer resistance to 
spectinomycin, streptomycin, kanamycin, lincomycin, gentamycin, 
hygromycin, methotrexate, bleomycin, phleomycin, blasticidin, 
sulfonamide, phosphinothricin, chlorsulfuron, bromoxynil, 

35 glyphosate, 2,4-D, atrazine, 4-methyltryptophan, nitrate, 

S-aminoethyl-L-cysteine, lysine/threonine, aminoethyl-cysteine or 
betaine aldehyde. Especially preferred are the genes aadA, nptll, 
BADH, FLAKE -S (a fusion of aadA and GFP, described by Khan MS & 
Maliga P, 1999 Nature Biotech 17: 910-915). 

40 

As selection marker which is functional in plastids, it is mainly 
the aadA gene which has been described (Svab Z and Maliga P 
(1993) Proc Natl Acad Sci USA 90:913-917). Also described are 
modified 16S rDNA, the nptll gene (kanamycin resistance) and the 
45 bar gene (phosphinothricin resistance). Owing to the preference 
given to the selection marker aadA, the latter is preferably 
recycled, i.e. deleted from the genome, or plastome, following 
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its use (Fischer N et al. (1996) Mol Gen Genet 251:373-380; 
Corneille S et al. (2001) Plant J 27:171-178), so that aadA can 
be reused as selection marker in further transformations of an 
already transplastomic plant. The betaine-aldehyde dehydrogenase 
5 (BADH) from spinach has been described as a further possible 
selection marker (Daniell H et al. (2001) Trends Plant Science 
6:237-239; Daniell H et al. (2001) Curr Genet 39:109-116; WO 
01/64023; WO 01/64024; WO 01/64850). Lethal agents such Is, for 
example, glyphosate, can also be utilized in connection with 
10 correspondingly detoxifying or resistant enzymes (WO 01/81605). 

Binding type markers may also be utilized. To utilize the DBS 
recognition sequence of the homing endonuclease I-Cpal in the 
gene of the 23s rRNA, which sequence is preferred as insertion 

15 site, at least the 3' end of the insertion sequence (preferably 
an artificial intron) is surrounded by homologous sequences of 
the target region. Thus, sequences of the 23S rDNA are 
incorporated into the transformation vector. Point mutations can 
be introduced at one position (position 2073 or 2074 of the 

20 tobacco 23S rRNA, sequence: AAAGACCCTATGAAG becomes sequence: 
SGAGACCCTATGAAG ) , which point mutations confer resistance to 
lincomycin to the ribosomes derived from a 23S rDNA which has 
been mutated thus (Cseplo A et al. (1988) Mol Gen Genet 
214:295-299). Further point mutations comprise those in the 

25 tobacco 16S rRNA which confer reistance to spectinomycin 
(mutation underlined) : 

a) 5 ' -GGAAGGTGAGGATG£-3 ' (A in native sequence) 
30 Other mutations confer resistance to streptomycin: 

b) 5 ' -GAATGAAACTA.-3 ' (C in native sequence) 
1.2 Negative selection markers 

35 

Negative selection markers make possible for example the 
selection of organisms with successfully deleted sequences which 
comprise the marker gene (Koprek T et al. (1999) The Plant 
Journal 19(6) :719-726) . For example, sequences which encode 
40 selection markers or DSBI enzymes can be deleted from the 
genome /plas tome after successful application of the method 
according to the invention. 

When carrying out a negative selection, for example a compound 
45 which otherwise has no disadvantageous effect on the plant is 
converted into a compound which is disadvantageous, for example 
owing to the negative selection marker introduced into the plant. 
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Genes which have a disadvantageous effect per se are furthermore 
also suitable, such as, for example, TK thymidine kinase (TK) and 
diphtheria toxin A fragment (DT-A), the codA gene product 
encoding a cytosine deaminase (Gleave AP et al- (1999) Plant Mol 
5 Biol 40(2) : 22 3-35; Perera RJ et al. (1993) Plant Mol Biol 23(4): 
793-799; Stougaard J (1993) Plant J 3:755-761), the cytochrome 
P450 gene (Koprek et al. (1999) Plant J. 16:719-726), genes 
encoding a haloalkane dehalogenase (Naested H (1999) Plant J* 
18:571-576), the iaaH gene (Sundaresan V et al. (1995) Genes & 
10 Development 9:1797-1810) or the tms2 gene (Fedoroff NV & Smith DL 
(1993) Plant J 3:273-289). 

The concentrations of the antibiotics, herbicides, biocides or 
toxins which are used in each case to carry out the selection 
15 must be adapted to the respective test conditions or organisms. 
Examples which may be mentioned in context with plants are: 
kanamycin (Km) 50 to 100 mg/1, hygromycin B 40 mg/1, phosphino- 
thricin (Ppt) 6 to 20 mg/1, spectinomycin (Spec) 15 to 500 mg/1. 

20 Furthermore, it is possible to express functional analogs of the 
abovementioned nucleic acids encoding selection markers. 
Functional analogs means, in the present context, all those 
sequences which have essentially the same function, i.e. which 
are capable of selecting transformed organisms. In this context, 

25 it is quite feasible that the functional analog differs with 
regard to other characteristics. For example, it can have a 
higher or lower activity, or else have further functionalities. 

Functional analogs means furthermore sequences which encode 
30 fusion proteins consisting of one of the preferred selection 
markers and another protein, for example a further preferred 
selection marker, one of the reporter proteins mentioned 
hereinbelow or a PLS. By way of example, a fusion of the GFP 
(green fluorescent protein) and the aadA gene may be mentioned 
35 (Sidorov VA et al. (1999) Plant J 19:209-216). 

2 . Reporter genes 

Reporter genes encode readily quantifiable proteins which, via 
40 their color or enzyme activity, allow an assessment of the 

transformation efficiency, the site or time of expression or the 
identification of transgenic plants. Very especially preferred in 
this context are genes encoding reporter proteins (see also 
Schenborn E , Groskreutz D. Mol Biotechnol. 1999; 13(l):29-44) 
45 such as 
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"green fluorescent protein" (GFP) {C hui WL et al., Curr 
Biol 1996, 6:325-330; Leffel SM et al., Biotechniques. 
23(5):912-8, 1997; Sheen et al. (1995) Plant Journal 
8(5):777-784; Haseloff et al. (1997) Proc Natl Acad Sci 
USA 94(6):2122-2127; Reichel et al. (1996) Proc Natl Acad 
Sci USA 93(12) :5888-5893; Tian et al. (1997) Plant Cell 
Rep 16:267-271; WO 97/41228). 

Chloramphenicol transferase, 

Lucif erase (Millar et al., Plant Mol Biol Rep 1992 
10:324-414; Ow et al. (1986) Science, 234:856-859); allows 
detection via bioluminescence . 

B-Galactosidase, encodes an enzyme for which a variety of 
chromogenic substrates are available. 

^-Glucuronidase (GUS) (Jefferson et al., EMB0 J. 1987, 6, 
3901-3907) or the uidA gene, which encodes an enzyme for'a 
variety of chromogenic substrates. 

R-Locus gene product: protein which regulates the production 
of anthocyanin pigments (red coloration) in plant tissue and 
thus makes possible the direct analysis of the promoter 
activity without addition of further auxiliary substances or 
chromogenic substrates (Dellaporta et al., in: Chromosome 
Structure and Function: Impact of New Concepts, 18th Stadler 
Genetics Symposium, 11:263-282, 1988). 

8-Lactamase (Sutcliffe (1978) Proc Natl Acad Sci USA 
75:3737-3741), enzyme for a variety of chromogenic substrates 
(for example PAD AC, a chromogenic cephalosporin). 

xylE gene product (Zukowsky et al. (1983) Proc Natl Acad Sci 
USA 80:1101-1105), catechol dioxygenase capable of converting 
chromogenic catechols. 

Alpha-amylase (Ikuta et al. (1990) Bio/technol. 8:241-242). 

Tyrosinase (Katz et al. (1983) J Gen Microbiol 
129:2703-2714), enzyme which oxidizes tyrosine to DOPA and 
dopaquinone, which subsequently form melanin, which can be 
detected readily. 
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Aequorin (Prasher et al. (1985) Biochem Biophys Res Commun 
126(3) :1259-1268), can be used in the calcium-sensitive 
bioluminescence detection. 

5 The selection marker, or the reporter gene, is preferably encoded 
on the transformation construct, especially preferably on the 
insertion sequence. However, it can also be encoded on an 
independent transformation construct which is introduced into the 
nucleus or the plastids of a plant cell in the form of a 
10 cotrans formation together with the transformation construct of 
interest • 

The trans formation vectors and insertion sequences according to 
the invention may comprise further functional elements- The 

15 concept of further functional elements is to be understood in the 
broad sense. Preferably, it is understood as meaning all those 
elements which influence the generation, multiplication, 
function, use or value of the insertion sequences, transformation 
constructs or transformation vectors used within the scope of the 

20 present invention. The following may be mentioned by way of 
example of further functional elements, but not by limitation: 

i. Replication origins (ORI) which make possible an 
amplification of the expression cassettes or vectors 

25 according to the invention in, for example, £• coli or else 

in plastids. Examples of £• coli ORIs which may be mentioned 
are the pBR322 ori, the P15A ori (Sambrook et al.: Molecular 
Cloning. A Laboratory Manual, 2nd ed. Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989) or the colEl 

30 ORI, for example from pBLUESCRIPT. Plastidic ORIs are 

described in US 5,693,507, US 5,932,479 or WO 99/10513. 

ii. Multiple cloning regions (MCRs) permit and facilitate the 
insertion of one or more nucleic acid sequences. 

35 

iii. Sequences which make possible the homologous recombination or 
insertion into the genome or plastome of a host organism. 

iv. Elements, for example border sequences, which make possible 
40 the Agrobacterium-mediated transfer into plant cells for the 

transfer and integration into the plant genome, such as, for 
example, the right or left border of the T-DNA or the vir 
region . 

45 An insertion sequence or an expression construct for a DSBI 
enzyme can be inserted advantageously using vectors into which 
these constructs, or cassettes, are inserted. Vectors can be 
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plasmids, cosmids, phages, viruses, retroviruses or else 
agrobacteria, by way of example. 

in an advantageous embodiment, the expression cassette is 
5 inserted by means of plasmid vectors. Preferred vectors are those 
which make possible a stable integration of the expression 
cassette into the host genome or plastome. 

The generation of a transformed organism or a transformed cell 
10 requires introducing the DNA in question into the host cell in 

ravallable" fT£* " A of methods 

is available for this procedure, which is referred to as 

transformation (see also Keown et al. (1990, Methods in 

Enzymology 185:527-537). Thus, for example, the DNA can be 

15 introduced directly by microinjection, electroporation or by 

bombardment with DNA-coated microparticles. Also, the cell can be 
permeabilized chemically, for example with polyethylene glycol 
so that the DNA can penetrate the cell by diffusion 

,« ^ ansforn,ation can al *o be effected by fusion with other 

20 DNA-comprising units such as minicells, cells, lysosomes or 
liposomes. Others which must additionally be mentioned are 
transection using calcium phosphate, deae dextran or cationic 
lipids, transduction, infection, the incubation of dry embryos in 
DNA-comprising solution, sonication, and the transformation of 

25 intact cells or tissue by microinjection or macroinjection into 
tissue or embryos, or tissue electroporation, or the vacuum 
infiltration of seeds. The skilled worker is familiar with such 

llT, ° f injection or electroporation of DNA into 

plant cells, the plasmid used need not meet any particular 

30 requirements. Simple plasmids such as those from the pUC series 
can be used. If intact plants are to be regenerated from the 
transformed cells, the presence of an additional, selectable 
marker gene on the plasmid is useful. Methods for the 
regeneration of plants from plant tissues or plant cells have 

35 also been described. - 

There are several options for introducing DNA into the plastids. 
The only decisive aspect for the present invention is that the 
DNA is introduced into the plastids. However, the present 
40 invention is not limited to a specific method. Any method which 
permits the introduction of the DNA to be transformed into the 

IT a h±9her " Suitable - Tne ^able transformation 

of plastids is a method with which the skilled worker is 
familiar; it has been described for higher plants (inter alia by 
45 Svab Z and Maliga P (1993) Proc Natl Acad Sci USA 90(3, :913-917) 
For example, the methods are based on transformation by means of 
the particle gun and insertion into the plastidic genome by 
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homologous recombination under selection pressure. Further 
methods are described in US 5,877,402. In EP-A 0 251 654, the DNA 
is introduced by Agrobacterium tumefaciens (see De Block M et al. 
(1985) EMBO J 4:1367-1372; Venkateswarlu K and Nazar RN (1991) 
5 Bio/Technology 9:1103-1105). It has furthermore been demonstrated 
that DNA can be introduced into isolated chloroplasts by means of 
electroporation, thus bringing about transient expression (To KY 
et al. (1996) Plant J 10:737-743). Transformation by means of a 
direct DNA transfer into plastids of protoplasts, for example 

10 using PEG (polyethylene glycol) is preferred (Koop HU et al. 
(1996) Planta 199:193-201; Kofer W et al. (1998) In Vitro Cell 
Dev Biol Plant 34:303-309; Dix PJ and Kavanagh TA (1995) 
Euphytica. 85:29-34; EP-A 0 223 247). Most preferred are 
biolistic transformation methods. Here, the DNA to be transformed 

15 is applied to, for example gold or tungsten particles. These 

particles are subsequently accelerated towards the explant to be 
transformed (Dix PJ and Kavanagh TA (1995) Euphytica. 85:29-34; 
EP-A 0 223 247). Thereafter, transplastomic plants are 
regenerated under selection pressure on suitable medium in the 

20 manner with which the skilled worker is familiar. Such methods 
have been described (for example US 5,451,513; US 5,877,402; 
Svab 2 et al. (1990) Proc Natl Acad Sci USA 87:8526-8530; Svab Z 
and Maliga P (1993) Proc Natl Acad Sci USA 90:913-917). Moreover, 
the DNA can be introduced into the plastids by means of 

25 microinjection. A specific microinjection method has been 
described recently (Knoblauch M et al. (1999) Nature Biotech 
17:906-909; van Bel AJE et al. (2001) Curr Opin Biotechnol 
12:144-149). This method is particularly preferred for the 
present invention. It is also possible to introduce, by means of 

30 protoplast fusion, the plastids from one species into a different 
species, to transform them in the latter and subsequently to 
return them to the original species by protoplast fusion 
(WO 01/70939) . 

35 Besides these "direct'' transformation techniques, transformation 
can also be carried out by bacterial infection by means of 
Agrobacterium tumefaciens or Agrobacterium rhizogenes (Horsch RB 
(1986) Proc Natl Acad Sci USA 83 (8) : 257 1-2575; Fraley et al. 
(1983) Proc Natl Acad Sci USA 80:4803-4807; Bevans et al. (1983) 

40 Nature 304:184-187). The expression cassette for, for example/ 
the DSBI enzyme is preferably intergrated into specific plasmids, 
either into a shuttle, or intermediate, vector or into a binary 
vector. Binary vectors are preferably used. Binary vectors are 
capable of replication both in E. coli and in Agrobacterium and 

45 be transformed directly into Agrobacterium (Holsters et al. 
(1978) Mol Gen Genet 163:181-187). Various binary vectors are 
known; some of them are commercially available such as, for 
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example, P B1N19 (Clontech Laboratories, Inc. USA; Bevan et al 
(1984) Nucl Acids Res 12:8711). The selection marker gene permits 
the selectxon of transformed agrobacteria and is, for example 
the nptll gene, which confers resistance to kanamycin. The binary 
5 plasmid can be transferred into the agrobacterial strain for 
example by electroporation or other transformation methods (Mozo 
& Hooykaas 1991, Plant Mol. Biol. 16, 917-918). The plant 
explants are generally cocultured with the agrobacterial strain 
for two to three days. The agrobacterium which, in this case 

10 acts as the host organism, should already comprise a plasmid'with 
the vir region. Many Agrobacterium tumefaciens strains are 
capable of transferring genetic material, such as, for example 
the strains EHA101[pEHA101] (Hood EE et al. (1996) j Bacteriol' 
168(3):1291-1301), EHA105[ P EHA105J (Hood et al. (1993) Transgenic 

15 Research 2:208-218), LBA4404 [pAL4404 ] (Hoekema et al. (1983) 
Nature 303:179-181), C58Cl(pMP90) (Koncz and Schell (19 86) Mol 
Gen Genet 204:383-396) and C58Cl[pGV22601 (Deblaere et al. (1985) 
Nucl Acids Res 13: 4777-4788). 

20 To transfer the DNA to the plant cell, plant explants are 
cocultured with Agrobacterium tumefaciens or Agrobacterium 
rhizogenes. Starting from infected plant material (for example 
leaf, root or stem portions, but also protoplasts or suspensions 
of plant cells), intact plants can be regenerated using a 

25 suitable medium which may comprise, for example, antiobiotics or 
biocides for selecting transformed cells. A cotransformed 
selection marker permits the selection of transformed cells from 
untransformed cells (Mccormick et al. (1986) Plant Cell Reports 
5:81-84). The plants obtained can be bred, selfed and hybridized 

30 in the customary manner. Two or more generations should be grown 
to ensure that the genomic integration is stable and hereditary. 
The abovementioned methods are described in, for example, Jenes B 
et al. (1993) Techniques for Gene Transfer, in: Transgenic 
Plants, vol. 1, Engineering and Utilization, edited by Rung SD 

35 and Wu R, Academic Press, pp. 128-143 and in Potrykus (1991) 
Ann Rev Plant Physiol Plant Molec Biol 42:205-225). 

The Agrobacterium-mediated transformation is best suited to 
dicotyledonous plant cells, whereas the direct transformation 
40 techniques are suitable for any cell type. 

The Agrobacterium-mediated transformation is especially 
preferably employed for the transformation of the nucleus, while 
the direct transformation techniques are especially preferably 
45 employed for the transformation of plastids. 
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As Boon as a predominanly horootransplastomic plant cell has been 
generated by the method according to the invention, an intact 
plant can be obtained using methods with which the skilled worker 
is familiar. The starting material for this purpose is, for 
5 example, callus cultures- The development of shoot and root can 
be induced in the known manner in this as yet undifferentiated 
bioroass. The shoots obtained can be planted out and used for 
breeding . 

10 Deletion methods 

In the above-described methods according to the invention, it is 
advantageous, at various levels, to remove certain sequences 
which have previously been introduced (for example sequences for 

15 selection markers and/or DSBI enzymes) from the plastome or 
genome of the plant or cell. Thus, it is advantageous, but not 
necessarily required, to remove the selection marker which has 
been introduced for example during the insertion of a nonnatural 
DSB recognition sequence, from the master plant since the same 

20 selection marker can then be utilized in a subsequent - 
transformation (for example with the insertion sequence). 
Deletion is particularly advantageous since the selection marker 
is no longer necessarily required after the selection phase, and 
therefore superfluous. Moreover, deletion increases the consumer 

25 acceptance and is desirable for registration purposes. Moreover, 
the protein synthesis apparatus of the plastid is not 
unnecessarily burdened by the synthesis of the marker protein, 
which has potentially advantageous effects on the characteristics 
of the plant in question. 

30 

The skilled worker is familiar with a variety of methods for the 
directed deletion of sequences. One which should be mentioned by 
way of example, but not by limition, is the excision by means of 
recombinases . Various sequence-specific recombination systems 

35 have been described, such as the Cre/lox system of bacteriophage 
PI (Dale EC and Ow DW (1991) Proc Natl Acad Sci USA 
88:10558-10562; Russell SH et al. (1992) Mol Gen Genet 234: 
49-59? Osborne BI et al. (1995) Plant J. 7, 687-701), the yeast 
FLP/FRT system (Kilby NJ et al. (1995) Plant J 8:637-652; Lyznik 

40 LA et al. (1996) Nucleic Acids Res 24:3784-3789), the Gin 

recombinase of the phage Mu, the Pin recombinase from £• coli or 
the R/RS system of the plasmid pSRl (Onouchi H et al. (1995) Mol 
Gen Genet 247:653-660; Sugita K et al. (2000) Plant J 
22:461-469). These methods can be utilized not only for deleting 

45 DNA sequences from the nuclear genome, but also from the plastome 
(Corneille et al. (2001) Plant J 27: 171-178; Hajdukieicz et al. 
(2001) Plant J 27:161-170). Further recombinases which can be 
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employed are, for example, PhiC31 (Kuhstoss & Rao (1991) J Mol 
Biol 222:897-908), TP901 (Christiansen et al. (1996) J Bacteriol 
178:5164-5173), xisF from Anabaena (Ramaswamy et al. (1997) Mol 
Microbiol 23:1241-1249), integrase from phage PhiLC3 (Lillehaug 
5 et al. (1997) Gene 188:129-136) or the recombinase encoded by 
the sre gene of the R4 phage (Matsuura et al. (1996) J Bacteriol 
178:3374-3376). 

In a preferred embodiment, however, the deletion is effected by 

10 intrachromosomal recombination owing to suitably introduced 
sequence duplications. The efficiency of the latter can be 
enhanced by the directed introduction of double-strand breaks 
near the sequence duplications (cf. Fig. 8). To this end, the 
sequence to be deleted is flanked bilaterally by homology 

15 sequences HI and H2 which have sufficient length and homology to 
undergo recombination with one another. Recombination is induced 
by the induction of at least one sequence-specific double-strand 
break of a further DSB recognition sequence located near one of 
the two homology sequences (but preferably different from the 

20 first one). This DSB recognition sequence is preferably localized 
between the two homology sequences. To induce the double-strand 
break, it is preferred to express or introduce a second DSBI 
enzyme which differs from the first one. This method is 
especially preferably utilized for deleting selection markers 

25 from the plastome. 

The invention furthermore relates to the transplastomic, 
predominantly homoplastomic, plants generated using the method 
according to the invention, and to parts of these plants, such as 
30 leaves, roots, seeds, fruits, tubers, pollen or cell cultures, 
callus and the like which are derived from such plants. 

The invention furthermore relates to the plants employed in the 
method according to the invention which comprise an expression 

35 cassette according to the invention for a DSBI enzyme or a fusion 
protein of PLS and DSBI enzyme, in this context, the expression 
cassette for the fusion protein of PLS and DSBI enzyme is - 
especially preferably - stably integrated in the nuclear DNA 
under the control of a promoter which is functional in the plant 

40 nucleus. The expression cassette encoding a DSBI enzyme under the 
control of a promoter which is active in plant plastids is 
preferably stably integrated into the plastome. Comprised are 
furthermore parts of same such as leaves, roots, seeds, tubers, 
fruits, pollen or cell cultures, callus and the like which are 

45 derived from the abovementioned plants. 
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Genetically modified plants according to the invention which can 
be consumed by humans and animals can also be used as foods or 
feeds , for example directly or following processing known per se. 

5 The invention furthermore relates to the use of the 

above-described transplastomic, predominantly homoplastomic, 
plants according to the invention and of the cells, cell 
cultures, parts — such as, for example, the roots, leaves and the 
like in the case of transgenic plant organisms - and transgenic 
10 propagation material such as seeds or fruits which are derived 
from them for the production of foods or feeds, pharmaceuticals 
or fine chemicals. 

Fine chemicals refers to enzymes such as, for example, the 

15 industrial enzymes mentioned hereinbelow, vitamins such as, for 
example, tocopherols and tocotrienols (for example vitamin E) and 
vitamin B2, amino acids such as, for example, methionine, lysine 
or glutamate, carbohydrates such as, for example, starch, 
amylose, amylopectin or sucrose, fatty acids such as, for 

20 example, saturated, unsaturated and polyunsaturated fatty acids, 
natural and synthetic flavorings, aroma chemicals such as, for 
example linalool, menthol, borneone (camphor), pinene, limonene 
or geraniol, and colorants such as, for examle, retinoids (for 
example vitamin A), flavonoids (for example quercetin, rutin, 

25 tangeretin, nobiletin) or carotenoids (for example P-carotene, 
lycopene, astaxanthin) . The production of tocopherols and 
tocotrienoles and of carotenoids is especially preferred. Growing 
the transformed host organisms, and isolation from the host 
organisms or the growth medium, are carried out using methods 

30 with which the skilled worker is familiar. The production of 

pharmaceuticals such as, for example, antibodies or vaccines has 
been described (Hood EE, Jilka JM. (1999) Curr Opin Biotechnol. 
10(4) : 382-386; Ma JK and Vine ND (1999) Curr Top Microbiol 
Immunol. 236 :275-92 ) . 

35 

The method according to the invention is particularly suitable 
for producing industrial enzymes within what is known as 
"phytof arming" . Examples of industrial enzymes which may be 
mentioned, but not by way of limitation, are lipases, esterases, 
40 proteases, nitrilases, acylases, epoxyhydrolases, amidases, 
phosphatases, xylanases, alcohol dehydrogenases, amylases, 
glucosidases , galactosidases , pullulanases , endocellulases , 
glucanases, cellulases, nucleases, chitin deacetylases, 
monoaminooxidases, lysozymes and laccases. 

45 
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Embodiments which are especially preferred for the purposes of 
the invention are described in greater detail hereinbelow within 
the explanations of the figures. witnin 

5 Sequences 



1. SEQ ZD NO: 1 

PCB42-94 Basic vector for plastid transformation. 

10 2. SEQ ID NO:2 

!r^?, a ^ d 8eguence in8 «ted into the multiple cloning site 
of PCB42-94 (SEQ ID NO: 1). Resulting vector: pCB199-3. 

3. SEQ ID NO: 3 

15 Nucleic acid sequence inserted into the multiple cloning site 
of pCB42-94 (SEQ ID NO: 1). Resulting vector: pCB401-20 

4. SEQ ID NO: 4 

Expession cassette from pCB289-13 for the expression of the 
20 I-Ppol homing endonuclease in plastids. 

5. SEQ ID NO: 5 

Amino acid sequence of the I-Ppol homing endonuclease encoded 
oy the expression cassette from pCB289-13. 

25 

6. SEQ ID N0:6 

PCB304 25 111 fra9ment empl °y ed for generating the vector 

30 7. SEQ ID NO: 7 

Nucleic acid sequence inserted into the multiple cloning site 
of pGEMTeasy. Resulting vector: pCB220-17 



35 



8. SEQ ID NO: 8 



Nucleic acid sequence inserted into the multiple cloning site 
of pBluescript. Resulting vector: pCB270-l 

9. SEQ ID NO: 9 

Sequence from vector P CB315-1: LacZ gene with inserted intron 
40 for detecting splicing. 

10. SEQ ID NO: 10 

Ll.LtrB intron from vector pCB345-34. 



45 
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11. SEQ ID NO: 11 

Synthetic sequence of the homing endonuclease I-Ppol (ORF: 16 
to 507) 

5 12. SEQ ID NO: 12 

Protein sequence of the homing endonuclease I-Ppol 

13. SEQ ID NO: 13 

Nucleic acid sequence of the homing endonuclease I-Cpal from 
10 Chlamydomonas pallidostigmatica (modification of the 

published sequence at position 69. An Ncol cleavage site was 
introdued at ATG (ORF: 4 to 462) 

14. SEQ ID NO: 14 

15 Protein sequence of the homing endonuclease I-Cpal 

15- SEQ ID NO: 15 

Sequence comprising the CpLSU2 intron 

20 16. SEQ ID NO: 16: Oligonucleotide primer pi 9 
5 ' -TAAGGCCC TCGGTAGC AAC GG- 3 ' 

17. SEQ ID NO: 17: Oligonucleotide primer p20 
5 ' -GGGGTACC AAATCC AACTAG- 3 ' 

25 

18. SEQ ID NO: 18: Oligonucleotide primer p21: 
5 ' -GGAGCTCGCTCCCCCGCCGTCGTTC-3 ' 

19. SEQ ID NO: 19: Oligonucleotide primer p22 
30 5 ' -GATGC ATG ATGACTTGACGGC ATCCTC - 3 ' 

20. SEQ ID NO: 20: Oligonucleotide primer pl90 
5 ' -GTCGACAGATCTTTAA- 3 ' 

35 21. SEQ ID NO: 21: Oligonucleotide primer pl91 
5 ' -AGATCTGTCGACTTAA-3 ' 

22. SEQ ID NO: 22: Oligonucleotide primer pl99 
5 ' - GATC TCC AGTTAACTGGGGTAC - 3 ' 

40 

23. SEQ ID NO: 23: Oligonucleotide primer p200 
5 ' -CCCAGTTAACTGGA- 3 ' 

24. SEQ ID NO: 24: Oligonucleotide primer p216 
45 5 ' -TTAAGCCAGTTAACTGGGCGGAGCT-3 ' 
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25. SEQ ID NO: 25: Oligonucleotide primer p219 
5 ' -CCGCCCAGTTAACTGGC-3 ' 

26. SEQ ID NO: 26: Oligonucleotide primer p276 
5 5 ' -TCGAGAAGATCAGCCTGTTATCCCTAGAGTAACT-3 ' 

27. SEQ ID NO: 27: Oligonucleotide primer p277 
5 ' -CTAGAGTTACTCTAGGGATAACAGGCTGATCTTC-3 ' 

10 28. SEQ ID NO: 28: Oligonucleotide primer p91 
5 ' -AGAAGACGATCCTAAGG-3 ' 

29. SEQ ID NO: 29: Oligonucleotide primer p92 
5 ' -TGAAGACTTGACAAGGAATTTCGC-3 ' 

15 

30. SEQ ID NO: 30: Oligonucleotide primer pl02 

5 ' -AGAAGACGATCCTAAATAGCAATATTTACCTTTG5S&C£AAAAGTTATCAGGCATG-3 

31. SEQ ID NO: 31: Oligonucleotide primer pl03 

20 5 ' TGAAGACTTGACAAGGAATTTCGCTACCTTCGAGTACTCCAAAACTAATC-3 ' 

32. SEQ ID NO: 32: Oligonucleotide primer p207 
5 '-GAGAAGACATTCCTAACACATCCATAACGTGCG-3 ' 

25 33. SEQ id NO: 33: Oligonucleotide primer p208 

5 ' -TGAAGACTTGACATTTGATATGGTGAAGTAGG-3 ' 



30 



34. SEQ ID NO: 34 

Nucleic acid sequence encoding the transit peptide of the 
small subunit (SSU) of ribulose bisphosphate carboxylase 
(Rubisco ssu ) from pea 



35. SEQ ID NO: 35 

Transit peptide of the small subunit (SSU) of ribulose 
35 bisphosphate carboxylase (Rubisco ssu) from pea 

36. SEQ ID NO: 36 

Transit peptide of the tobacco plastidic transketolase. 

40 37. SEQ ID NO: 37 

Nucleic acid sequence encoding the transit peptide of the 
tobacco plastidic transketolase (reading frame 1; pTP09) 

38. SEQ ID NO: 38 
45 Nucleic acid sequence encoding the transit peptide of the 
tobacco plastidic transketolase (reading frame 2; pTPIO) 
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39. SEQ ID NO: 39 

Nucleic acid sequence encoding the transit peptide of the 
tobacco plastidic transketolase (reading frame 3; pTPll) 

5 40. SEQ ID NO: 40 

Transit peptide of the plastidic isopentenyl-pyrophosphate 
isomerase-2 (IPP-2) from Arabidopsis thaliana. 

41. SEQ ID NO: 41 

10 Nucleic acid sequence encoding the transit peptide of the 

plastidic isopentenyl-pyrophosphate isomerase-2 (IPP-2) from 
Arabidopsis thaliana (reading frame 1? IPP-9) 

42. SEQ ID NO: 42 

15 Nucleic acid sequence encoding the transit peptide of the 

plastidic isopentenyl-pyrophosphate isomerase-2 (IPP-2) 
Arabidopsis thaliana (reading frame 2; IPP-10) 

43. SEQ ID NO: 43 

20 Nucleic acid sequence encoding the transit peptide of the 

plastidic isopentenyl-pyrophosphate isomerase-2 (IPP-2) from 
Arabidopsis thaliana (reading frame 3; IPP-11) 

44. SEQ ID NO: 44 

25 Nucleic acid sequence encoding the tobacco PrbcL promoter. 

45. SEQ ID NO: 45 

Nucleic acid sequence encoding the tobacco Prpsl6-107 
promoter . 

30 

46. SEQ ID NO: 46 

Nucleic acid sequence encoding the tobacco Prrnl6 promoter. 

47. SEQ ID NO: 47 

35 Nucleic acid sequence encoding the tobacco PaccD-129 
promoter. 

48. SEQ ID NO: 48 

Nucleic acid sequence encoding the tobacco PclpP-53 promoter. 

40 

49. SEQ ID NO: 49 

Nucleic acid sequence encoding the tobacco Prrn-62 promoter. 



50. 

45 



SEQ ID NO: 50 

Nucleic acid sequence encoding the tobacco Prpsl6 promoter. 
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51. SEQ ID NO: 51 

Nucleic acid sequence encoding the tobacco PatpB/E-290 
promoter . 

5 52, SEQ ID NO: 52 

Nucleic acid sequence encoding the tobacco PrpoB-345 
promoter . 

53. SEQ ID NO: 53 

10 Nucleic acid sequence encoding a promoter derived from the 

consensus sequence of the E . coli o70 promoters. 

54. SEQ ID NO; 54 

Nucleic acid sequence encoding the 5 '•untranslated region of 
15 the tobacco psbA gene (5'psbA) 

55. SEQ ID NO: 55 

Nucleic acid sequence encoding the 5 '-untranslated region 
including 5' portions from the coding region of the tobacco 
20 rbcL gene (5'rbcL). 

56. SEQ ID NO: 56 

Nucleic acid sequence encoding the 5 '-untranslated region of 
the tobacco rbcLs gene . 

25 

57. SEQ ID NO: 57 

Nucleic acid sequence encoding the 3 '-untranslated region of 
the Synechocystis psbA-1 gene (3'psbA-l) 

30 58. SEQ ID NO: 58 

Nucleic acid sequence encoding the 3 '-untranslated region of 
the tobacco psbA gene (3'psbA) 

59. SEQ ID NO: 59 

35 Nucleic acid sequence encoding the 3 ' -untranslated region of 

the tobacco rbcL gene (3'rbcL) 

60. SEQ ID NO: 60 

Nucleic acid sequence encoding synthetic ribosome binding 
40 sites (RBS) 9 

61. SEQ ID NO: 61 

Nucleic acid sequence encoding synthetic ribosome binding 
sites (RBS) y 

45 
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62- SEQ ID NO; 62 

Complete insert of the vector pCB3 04-25 

63. SEQ ID NO: 63 

5 Bglll / Muni fragment of the vector pCB320-192. 

64. SEQ ID. NO: 64: Oligonucleotide primer p93 
5 ' -AAAGATCTCCTCACAAAGGGGGTCG-3 ' 

10 65. SEQ ID NO: 65: Oligonucleotide primer p97 
5 ' -TCGAAGACTTAGGACCGTTATAG-3 ' 



66. SEQ ID NO: 66: Oligonucleotide primer p98 
5 ' -AGGAAGACCTTGTCGGGTAAGTTCCG-3 ' 

15 

67. SEQ ID NO: 67: Oligonucleotide primer p95: 
5 ' -CTCAATTGGGGTCTCTCTGTCCAGGTGCAGG-3 ' 

68. SEQ ID NO: 68: Nucleic acid sequence encoding fusion proteins 
20 from the native I-Ppo-I nuclease and the IPP plastid 

localization sequence (ORF for l-ppol: 181-672; IPP transit 
peptide: 1-180; native sequence from 1-172). 



69. SEQ ID NO: 69: Fusion proteins of the native I-Ppo-I nuclease 
25 and the IPP-plastid localization sequence. 

70. SEQ ID NO: 70: Nucleic acid sequence encoding long version of 
the I-Ppol homing endonuclease. 

30 71. SEQ ID NO: 71: Amino acid sequence encoding long version of 
the I-Ppol homing endonuclease. 

72. SEQ ID NO: 72: Nucleic acid sequence encoding a 

promoter sequence derived from the 
35 consensus sequence of the o70 promoters 

from E.coli. 

73. SEQ ID NO: 73: Nucleic acid sequence encoding the 

artificial intron TetIVS2a. 

40 

74. SEQ ID NO: 74: Insert of vector pCB459-l 



75. SEQ ID NO: 75: Insert of vector pCB478-3 



45 76. SEQ ID NO: 76: Insert of vector pCB492-25 
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77. SEQ ID NO: 77: Oligonucleotide primer P 396 

5 ' -TAGTAAATGACAATTTTCCTCTGAATTATATAATTAACATGGCGACTGTTTACCAAAA 
AC-3 

5 78. SEQ ID NO: 78: Oligonucleotide primer p95 

5 ' -CTCAATTGGGGTCTCTCTGTCCAGGTGCAGG-3 ' 

79. SEQ ID NO: 79: Nucleic acid sequence encoding PCR product 

Prom-TetIVS2a-Cpa 

10 

80. SEQ ID NO: 80: Insert of vector pCB435-45 

81. SEQ ID NO: 81: Nucleic acid sequence encoding probe for 

Southern blot analysis (directed against 
15 portions of the 16SrDNA) . 

82. SEQ id NO: 82: Nucleic acid sequence encoding probe for 

Southern blot analysis (directed against 
portions of the 23SrDNA) . 

20 

83. SEQ ID NO: 83: Insert of vector pCB456-2 

84. SEQ ID NO: 84: Insert of vector pCB528-2 from Kpnl to Sad 
25 Figures 

Within the method according to the invention, particularly the 
embodiments detailed in the figures hereinbelow are especially 
preferred. The following abbreviations are generally used in the 
30 figures: " """ 

A, A' Pair of homologous sequences A and A' 

A/A' Result of a homologous recombination between A and A' 
and/or a substitution of A by A' caused by repair 
35 synthesis. 

B, B' Pair of homologous sequences B and B' 

B/B' Result of a homologous recombination between B and B' 
and/or a substitution of B by B' caused by repair 
synthesis . 

40 HI, H2: Pair of homologous sequences HI and H2 

Hl/2: Sequence as the result of the homologous recombination of 
HI and H2 

DS Functional DSB recognition sequence 

nDS nonfunctional half of a DSB recognition sequence 

45 E: DSBI enzyme 

P : Promoter 

I: Further nucleic acid sequence (gene of interest) 
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S, S' Positive selection markers 
NS Negative selection marker 

IS \ Intron sequences. The intron in total is marked as a box. 

The box comprises all elements required for a functional 
5 intron. 

As already described above, A/A' and B/B' are the result of a 
homologous recombination and/or a substitution brought about by 
repair synthesis. The resulting sequence, in turn, can be the 
10 starting sequence for further homologous recombinations or repair 
syntheses. For the sake of simplicity, this sequence (A/ A' and 
B/B') is again referred to as A and B f respectively, in the steps 
which follow. 

15 1. Fig. 1: Introduction of a DSB recognition sequence into the 
plastome by means of double cross -over 

In an especially preferred embodiment l r a DSBR construct is 
first introduced into plastids of a higher plant. In this 

20 embodiment, the DSBR construct is preferably equipped with 
homologous target regions and with an expressible selection 
marker (promoter - 5'UTR — selection marker - 3'UTR) and, in this 
embodiment, preferably comprises a recognition region for a DSBI 
enzyme which preferably has no natural recognition sequence in 

25 the plastidic genome of the (untrans formed) plant in question. 
The DSBR construct can optionally already encode further genes of 
interest. Predominantly homoplastomic master plants are generated 
(Fig. 1). 

30 2. Fig . 2A-E: Introduction of an insert sequence with an expression 
cassette for a DSBI enzyme and, if appropriate, 
selection markers and further genes of interest 

Explants of the master plants generated in embodiment 1 are 
35 utilized for a further transformation with a transformation 
construct according to the invention. Preferably, the 
transformation construct according to the invention has regions 
which are homologous to the sequences surrounding the insertion 
site of the DSBR construct, which regions are preferably located 
40 on both sides (Fig. 2A, 2B) or on one side (Fig. 2C, 2D) of the 
insertion sequence. Insertion now takes place via homologous 
recombination (for example cross-over) or via repair synthesis. 

The sequences to be inserted are especially preferably — 
45 following the homology sequences inwardly — flanked by portions 
of the DSB recognition sequence (nDS) which correspond to the 
portions originating as the consequence of cleavage with the DSBI 
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enzyme (Fig. 2B). The insertion sequence thus comprises sequences 
which correspond in detail to the ends which are the result^ a 
cleavage in the plastome and thus ensure a particularly efficient 
incorporation. y erriclent 

If the transformation construct, or the insertion sequence, has 
no such homologous regions, the insertion sequence is preferably 
provided, at these ends, with overhangs which are also generated 

10 trig 2E> BI GnZyme ClGaVa9e ° f ^ maSt6r PlSnt PlaSt0rae 



If only one homology sequence is present, this sequence borders 
in an especially preferred embodiment, an nDS sequence (see 
hereinabove as described for Fig. 2B>, while the other side of 
15 the insertion sequence is provided with overhangs which 

correspond to those generated by the DSB1 enzyme in the plastome 
of the master plant (Fig. 2D). 

20 ^L^r rti °\ SeqU ! nCe ° ptionaU r c °des *>r a further expressible 
construe? fUnctionall r f "m that of the DSBR 

construct, if appropriate one or more expressible genes of 
interest and the expressible DSBI enzyme, which cleaves the 
recognition sequence introduced by the DSBR construct at the 
insert site in the plastidic genome of the master plant. The 

25 "sertion sequence of the transformation construct is inserted, 
in this context, in such a way at a position that said 
recognition region is no longer functional after the insertion. 

3. Fig. 3: Introduction of a DSB recognition sequence into the 
Plastome by means of double cross-over in a 
transcriptionally active region 

in a further, especially preferred embodiment 2, a DSBR construct 

35 e^d h intr ° dUCed int ° * lastids of a higher plant. ~£T 

35 embodiment, the DSBR construct is preferably equipped with 
homologous target regions and with an expressible selection 
marker, an endogenous promoter of the plastome being utilized, 

»\Zr feZablY additionall y comprises a recognition sequence 
for a DSBI enzyme which preferably has no natural recognition 
40 sequence in the plastidic genome of the (nontrans formed, plant in 
question. The DSBR construct may already encode genes of 
(F^t.* Pred ° minantly homoplastomic master plants are generated 

45 4. Fig.4A-E: Introduction of an isertion sequence with a cassette 
encoding a DSBI enzyme and, if appropriate, a 
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selection marker and further genes of interest 

ExplantB of the roaster plants generated in embodiment 2 are 
utilized for a further transformation with a transformation 
5 construct according to the invention. Preferably, the 

transformation construct according to the invention has regions 
which are homologous to the sequences surrounding the insertion 
site of the DSBR construct, which regions are preferably located 
on both sides (Fig, 4A, 4B) or on one side (Fig, 4C, 4D) of the 

10 insertion sequence. Insertion now takes place via homologous 
recombination (for example cross-over) or via repair synthesis. 
The sequences to be inserted are especially preferably — 
following the homology sequences inwardly — flanked by portions 
of the DSB recognition sequence (nDS) which correspond to the 

15 portions originating as the consequence of cleavage with the DSBI 
enzyme (Fig. 4B) . The insertion sequence thus comprises sequences 
which correspond in detail to the ends which are the result of a 
cleavage in the plastome and thus ensure a particularly efficient 
incorporation . 

20 

If the transformation construct, or the insertion sequence, has 
no such homologous regions, the insertion sequence is preferably 
provided, at these ends, with overhangs which are also generated 
by the DSBI enzyme after cleavage of the master plant plastome 
25 (Fig. 4E). 

If only one homology sequence is present, this sequence borders, 
in an especially preferred embodiment, an nDS sequence (see 
hereinabove as described for Fig. 4B), while the other side of 
30 the insertion sequence is provided with overhangs which 

correspond to those generated by the DSBI enzyme in the plastome 
of the master plant (Fig. 4D) . 

The insertion sequence optionally codes for a further expressible 
35 selection marker which differs functionally from that of the DSBR 
construct, if appropriate one or more expressible genes of 
interest and the expressible DSBI enzyme, which cleaves the 
recognition sequence introduced by the DSBR construct at the 
insert site in the plastidic genome of the master plant. The 
40 insertion sequence of the transformation construct is inserted, 
in this context, in such a way at a position that said 
recognition region is no longer functional after the insertion. 

5. Fig.5A-E: Introduction of an insertion sequence with a cassette 
45 encoding a DSBI enzyme and, if appropriate, selection 

markers and further genes of interest utilizing 
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natural, endogenous DSB recognition sequences 

In a further, very especially preferred embodiment 3, a 
transformation construct according to the invention comprises an 
5 expressible DSBI enzyme which has an endogenous, natural 

recognition sequence in the plastome of the plant in question. 

Explants of these natural master plants are utilized for a 
transformation with a transformation construct according to the 

10 invention. Preferably, the transformation construct according to 
the invention has regions which are homologous to the sequences 
surrounding the insertion site of the DSBR construct, which 
regions are preferably located on both sides (Fig. 5 A, 5B) or on 
one side (Fig. 5C, 5D) of the insertion sequence. Insertion now 

15 takes place via homologous recombination (for example cross-over 1 
or via repair synthesis. ' 

The sequences to be inserted are especially preferably - 
following the homology sequences inwardly _ flanked by portions 

20 of the DSB recognition sequence (nDS) which correspond to the 
portions originating as the consequence of cleavage with the dsbi 
enzyme (Fig. SB). The insertion sequence thus comprises sequences 
which correspond in detail to the ends which are the result of a 
cleavage in the plastome and thus ensure a particularly efficient 

25 incorporation. 1 eni 

If the transformation construct, or the insertion sequence, has 
no such homologous regions, the insertion sequence is preferably 
provided, at these ends, with overhangs which are also generated 

Tr^MT' enZymS aft6r CleaV39e ° f ^ mBSter ^ P 1 "^ 

If only one homology sequence is present, this sequence borders 
in an especially preferred embodiment, an nDS sequence (see ' 
35 hereinabove as described for Fig. SB), while the other side of 
the insertion sequence is provided with overhangs which 
correspond to those generated by the DSBI enzyme in the plastome 
of the master plant (Fig. 5D). 

40 The insertion sequence of the tranformation construct is, in this 
context, preferably inserted at a position in such a way that 
said recognition region is no longer functional after the 
insertion. The insertion sequence preferably encodes an 

45 anHh! ible ^f 011 " rtor < s ')' one or more genes of interest, 
45 and the expressible DSBI enzyme. The selection marker is 
optional. ~ " 
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6. Fig.6A-E: Introduction of an insertion sequence with a cassette 
encoding genes of interest and, if appropriate, 
selection markers, and introduction of a DSBI enzyme 
in trans 

5 

In further preferred embodiments 4, the DSBI enzyme is not 
encoded by the transformation construct, but either expressed in 
trans (in plastids or as PLS fusion protein in the nucleus) or 
trans fected into the plastids in the form of RNA or protein. The 

10 DSBI enzyme recognizes either an artificially introduced DSB 
recognition sequence (Fig. 6 A, 6B) or a natural DSB recognition 
sequence (Fig. 6C, 6D) . This embodiment is especially preferred 
when the transformation construct comprises no promoter elements , 
and expression of the coded genes is only realized after 

15 insertion into the plastome, using plastidic, endogenous 
promoters . 

As was the case in the embodiments which have already been 
described above, the transformation construct preferably has 
20 regions bilaterally (Fig. 6A, 6B) or unilaterally (not shown) of 
the insertion sequence which are homologous to the sequences 
surrounding the insertion site of the DSBR construct. Insertion 
now takes place via homologous recombination (for example 
cross-over) or repair synthesis. 

25 

The sequences to be inserted are especially preferably — 
following the homology sequences inwardly — flanked by portions 
of the DSB recognition sequence (nDS) which correspond to the 
portions originating as the consequence of cleavage with the DSBI 
30 enzyme (Fig. 6B, 6D). The insertion sequence thus comprises 
sequences which correspond in detail to the ends which are the 
result of a cleavage in the plastome and thus ensure a 
particularly efficient incorporation. 

35 If the transformation construct, or the insertion sequence, has 
no such homologous regions, the insertion sequence is preferably 
provided, at these ends, with overhangs which are also generated 
by the DSBI enzyme after cleavage of the master plant plastome 
(Fig. 6E). The transformation construct can additionally comprise 

40 a sequence encoding a DSBI enzyme. However, expression only takes 
place after successful insertion into the plastome, so that it is 
desirable that a first amount of functional RNA or protein of a 
DSBI enzyme is provided. 



45 If only one homology sequence is present, it borders, in an 
especially preferred embodiment, an nDS sequence (see above as 
described for Fig. 6B, 6D), while the other side of the insertion 
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eequence is provided with overhangs which correspond to those 

generated by the DSBI enzyme in the plastome of the master plant 
(not shown) . * 

5 in this context, the insertion sequence of the transformation 
construct is preferably inserted at a position in such a way that 
said recognition region is no longer functional after the 
insertion. 

10 7. Fig. 7A-Es Introduction of an insertion sequence comprising an 
mtron sequence with a cassette encoding genes of 
interest and, if appropriate, selection markers or 
DSBI enzymes 

15 in a further, very especially preferred embodiment 5, the gene of 
interest (and optionally a selection marker S' and/or the DSBI 
enzyme) is/are encoded within an intron which is functional at 
the insertion site selected, i.e. which can splice out of the 
transcript formed therein. 

20 

Preferably, the transformation construct according to the 
invention has regions which are homologous to the sequences 
surrounding the insertion site of the DSBR construct, which 
regions are preferably located on both sides (Fig. 7A, 7B) or one 
25 one side (not shown, of the insertion sequence. Insertion now 
takes place via homologous recombination (for example cross-over » 
or via repair synthesis. ' 

3Q lLTV T T 6fOImati ° n C ° nStruct or the insertion sequence has no 
30 such homologous regions, the insertion sequence is preferably 
provided, at these ends, with overhangs which are also generated 
by the DSBI enzyme after cleaving the plastome of the master 
plant (not shown) . 

35 Expression can be controlled by means of a promoter which is 
present on the transformation construct (Fig. 7B) or an 
endogenous, plastidic promoter (Fig. 7A) . in the first case, the 
DSBI enzyme is preferably present on the transformation construct 
Fig. 7B), while, in the latter case, it is either expressed^ 

40 least in parallel) in trans (in pl astids or as PLS fuLn proiein 
in the nucleus) or transfected into the plastids in the form of 
RNA or protein (Fig. 7A) . 

in this context, the insertion sequence of the transformation 
45 construct is preferably inserted at a position in such a way that 
said recognition region is no longer functional after the 
insertion. The transformation construct can additionally 
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optionally comprise a sequence encoding a DSBI enzyme. However , 
expression only takes place after successful insertion into the 
plastome, so that it is desirable that a first amount of 
functional RNA or protein of a DSBI enzyme is provided. 

5 

8. Fig. 8; Deletion of sequences by means of intramolecular 

homologous recombination induced by sequence-specific 
double -strand breaks 

10 In all of the above-described embodiments, sequences — for 

example those encoding selection markers or DSBI enzymes — are 
preferably flanked by homology sequences HI and H2 with 
sufficient length and homology to undergo recombination with one 
another. The recombination is induced by the induction of at 

15 least one double-strand break in the DSB recognition sequence 
located between the two homology sequences. To induce the 
double-strand break, it is preferred to transiently express or 
introduce a DSBI enzyme (Fig. 8). 

20 The skilled worker realizes that the sequence of the genes 

expressed in an operon is exchangeable and can thus vary in the 
above-described embodiments. Also, when using only one homology 
sequence for inserting the insertion sequence, this homology 
sequence may be localized at the 5' side (as shown in the 

25 figures) or the 3' side of the double-strand break, in principle, 
the DSBI enzyme can be expressed on the transformation construct 
and/or separately (in the nucleus or the plastids) and introduced 
differently into plastids, for example by transfection with RNA 
or protein. 

30 

9. Fig. 9: Southern analysis of predominantly homotransplastomic 

plants 

Wild-type and predominantly homotransplastomic master plants were 
35 analyzed with regard to the modification (introduction of a DSB 
recognition sequence? cf. Example 4). Owing to the modification, 
a 1750 bp band was detected (lanes 2, 3 and 4 corresponding to 
lines CB199NTH-4, -6 and -8), while a 3100 bp band was detected 
in the unmodified wild-type plant (lane 1), 

40 

10. Fig. 10: Modification of the IGS of the Tetrahymena LSU 

intr on . 

Capital letters indicate the sequence of the intron, while 
45 lower-case letters represent the sequence of the surrounding 

exons. The flanking exon sequences, the 5' and 3' portion of the 
intron or intron derivative, and the sequence comprising the ICS 
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are shown Bars between the bases indicate possible base pairings 
whxch can be forced for initiating the splicing procedure 

The abovementioned sequence segments of the naturally 
occurring Tetrahymena LSU intron in its natural exon 
environment are shown. 

The abovementioned sequence segments of the Tetrahymena LSU 
xntron derxvative generated within the scope of the present 
invention (TetIVS2a) in the above-defined exon environment, 
as is found xn the CpLSUS intron within the DSB recognition 
sequence of the DSBI enzyme I-Cpal are shown. Letters in bold 
represent the mutations carried out in comparison with the 
natural sequence. 



11. Fig. 11: 



20 



25 B: 



30 C 



iZ rlitT^l With BamHI - cut tota l ™ A ^om the tobacco 
lines CB255 + 435 N TH-16b, -16c, -19 and -20. A region of the 
16Sr ONA was used as probe. The bands representing plastome 
copies whxch correspond to the wild type ( WT; approx. 3 Txb 
band detected, and those which bear the transgene 
approx. 2.3 kb band detected) are identified by arrows. 

schematic representation of transplastomic tobacco plants 
whxch have originated by the insertion of the insertion 
sequence from pCB435-45; and the bands to be expected in a 
corresponding Southern analysis (cf. A). 

Southern analysis with Hindlli-cut total DNA from the tobacco 

23 X s n r S 0^ 5N r _16b ' - 16C ' - 20 ' A "9ion of the 

llti k t 38 Pr0be * The bandB "Presenting plastome 

copxes whxch correspond to the wild type (WT; approx. 1 1 kb 
band detected) and those which bear the transgene ( w. 
35 approx. 1.5 kb band detected) are identified by arrows. 

D: Schematic representation of transplastomic tobacco plants 
which have originated by the insertion of the insertion 
sequence from pCB255-l; and the bands to be expected in a 
40 corresponding Southern analysis (cf. C). 

12. Fig. 12s 

A: Wild-type and predominantly homotransplastomic master plants 
« were analyzed in a Southern analysis with regard to the 
modxfxcation (introduction of one of the I-Ppol osb 
recognition sequence; cf. Example 14.2). Owing to the 
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modification, an approximately 1.7 kb band was detected in 
the DNA which had been treated here with EcoRI (TG; lanes 1 
and 4 corresponding to lines CB456NTH-1 and -15) , while an 
approximately 3.1 kb band was detected in the unmodified 
5 wild-type plant (WT; lane 6). (wt - unmodified wild-type 
plant; wild type - shows the expected fragment size in 
unmodified wild-type plants; transgenic — shows the expected 
fragment size in plants CB456KTH) 

10 B: Schematic representation of the EcoRI fragment which was to 
be expected in A by hybridization with the probe in a 
modified plant CB456NTH. (trnV - gene encoding a tRNA-Val; 
rrnl6 — gene encoding the 16SrRNA; aadA — gene encoding a 
selection marker; 3'psbA (Synec) - noncoding region upstream 

15 of the Synechocystis psbA-1 gene, here incorporated into the 

expression cassette for the selection marker aadA; Psynth. — 
synthetic promoter derived from the consensus sequence for E, 
coli a70 promoters; DSB-R: DSB recognition sequence). 

20 Examples 

General methods: 

Oligonucleotides can be synthesized chemically for example in the 
25 known manner, using the phosphoamidite method (Voet, Voet, 2nd 
edition, Wiley Press New York, pages 896-897). The cloning steps 
carried out within the scope of the present invention such as, 
for example, restriction cleavages, agarose gel electrophoresis, 
purification of DNA fragments, transfer of nucleic acids to 
30 nitrocellulose and nylon membranes, linkage of DNA fragments, 
transformation of E . coli cells, bacterial cultures, propagation 
of phages, and the sequence analysis of recombinant DNA, are 
carried out as described by Sambrook et al. (1989) Cold Spring 
Harbor Laboratory Press; ISBN 0-87969-309-6, Recombinant DNA 
35 molecules are sequenced with an ALF- Express laser fluorescence 
DNA sequencer (Pharmacia, Uppsala, Sweden) following the method 
of Sanger (Sanger et al. (1977) Proc Natl Acad Sci USA 
74:5463-5467) . 

40 Example 1: Generating a basic vector for the transformation of 
plastids 

Initially, the selected target regions were cloned from the 
plastome of tobacco cv. SRI by means of PCR. The left-hand target 
45 region was amplified using the primers pl9 and p20. 
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pl9: 5 ' -TAAGGCCCTCGGTAGCAACGG-3 ' (SEQ ID NO: 16) 

p20: 5 ' -GGGGTACCAAATCCAACTAG-3 ' (SEQ ID NO: 17) 

5 The primers P 21 and P 22 were used for amplifying the right-hand 
target region, the last-mentioned primer additionally introducing 
a spectanomycin resistance into the amplfied part of the 16S 
rDNA, in addition to the SRI resistance (binding-type marker). 

10 p21: 5'-GGAGCTCGCTCCCCCGCCGTCGTTC-3' (SEQ ID NO: 18) 

p22: 5 ' -GATGCATGATGACTTGACGGCATCCTC-3 ' (SEQ ID NO: 19) 

The two amplified regions were cloned into pBluescript and 
15 pZeroBlunt, respectively, and sequenced. The left-hand and 
right-hand target regions were subsequently cloned into the 
backbone of the P UC19 vector. The cleavage sites BcoO109I and 
Pvull of the vector were used for this purpose. A multiple 
cloning site from pBluescript (from Kpnl to SacI) was cloned 
20 between the left-hand and right-hand target region. This multiple 
cloning site resides between the two plast id-encoded genes trnv 
and rrnl6 in the basic vector for the transformation of plastids 

!! VeCt ° r f ° r the ^sforaation of plastids was named ' 
PCB42-94 (SEQ ID NO: 1). The vector comprises the following 
25 sequence elements: 

a) Position complementary to bp 55-1405: right-hand target 
region with the partial gene of the 16S rRNA (complementary 
to bp 56 to 1322). The latter comprises mutations for the 

30 streptomycin resistance (SRI, position bp 346) and 
spectinomycin resistance (SPC1, position bp 68). 

b) Position complementary to bp 2374 to 1510: left-hand target 
region comprising 0RF131 (bp 1729 to 2124) and trnV gene 

35 inter alia (complementary to bp 1613 to 1542). 

c) Position bp 1404 to 1511: multiple cloning site 

M d) ! 08 ^ i0n bP 2 " 9 t0 3417 : resistance in the vector 

40 backbone 

Example 2: Generation of a vector (pCB199-3) for introducing a 
non-naturally occurring recognition region for the 
homin 9 endonucleases I-Ppol into the plastome of 
45 tobacco 
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Various elements were cloned one after the other into the 
multiple cloning site of the basic vector pCB42-94 (SEQ ID NO: 1) 
for the trans formation of plastids: 

5 a) frt recognition region ( mutated , contains no Xbal cleavage 
site; complementary to 1307-1354) 

b) expression cassette for expressing the marker gene aadA, 
consisting of: 

0 

i) promoter of the gene for 16S rRNA (complementary to 
1191-1281) 



ii) 5 '-untranslated region of the tobacco rbcL gene 
15 (complementary to 1167-1184) including mutated 5' 

portions of the rbcL gene (duplication of 6 AS of the 
rbcL gene, partly mutated, as the consequence of the 
cloning strategy so that a fusion encoding a total of 12 
amino acids (complementary to 1131-1166) with the 
20 subsequent element , the aadA gene, was formed) 

iii) aadA gene (complementary to 336-1130) 

iv) the 3' region of the psbA gene (complementary to 232-323) 

25 

c) core recognition region for the homing endonuclease I-Ppol 
(complementary to 176-190). 



This vector, which is referred to as pCB199-3, comprises the 
30 abovementioned elements within the nucleic acid sequence with the 
SEQ ID NO: 2 instead of the multiple cloning site in the basic 
vector for the transformation of plastids. The sequence which 
replaces the complete MCS from Kpnl to SacI is shown. However, 
there is no longer a Kpnl cleavage site in the sequence shown, 
35 owing the cloning strategy. 

Example 3: Generation of a further vector (pCB401-20) for 

introducing a non-naturally occurring recognition 
region for the homing endonuclease I-Ppol into the 
40 plastome of tobacco 



In contrast to the vector pCB199-3 described in Example 2, the 
vector described herein comprises no promoter and 3'UTR linked 
directly to the selection marker aadA. Rather, the expression of 
45 the aadA gene is controlled starting from the promoter of the 
trnV gene which is localized in the plastidic genome or in the 
left-hand target region upstream of the aadA gene. The purpose of 
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generating this vector was to avoid sequence duplication by 
exploiting regulatory regions from the tobacco plastidic genome 
To this end, various elements were cloned one after the other 
into the multiple cloning site of the basic vector pCB42-94 for 
5 the transformation of plastids: 

a) ribosome binding site (complementary to bp 1033 to 1050) 

b) aadA gene (complementary to bp 238 to 1032) 



10 



c) core recognition region for the homing endonuclease I- Ppo i 
(complementary to bp 176 to 190) 

The resulting vector also confers spectinomycin resistance in E 
15 coli. This vector, which is referred to as pCB401-20, comprises 
the abovementioned elements within the nucleic acid sequence with 
the SEQ ID NO: 3 instead of the multiple cloning site in the 
basic vector pCB42-94 (SEQ ID NO: 1) for the transformation of 
plastids. Again, all of the sequence which replaces the MCS (from 
20 Sacl to Kpnl) is shown. y 

Example 4: Generation of predominantly homoplastomic tobacco 
master plants comprising a non-natural DSB 
recognition sequence 

25 

The plasmid pCB199-3 was introduced into the plastids of tobacco 
(Nicotiana tabacum cv. Petit Havana) as described hereinbelow 
The regenerated plants were named CB199NTH. Independent lines 
were provided with different last numbers (for example 
30 CB199NTH-4). * 

The vector pCB401-20 is introduced analogously into the plastids 
of tobacco. Accordingly, the resulting plants are named CB401NTH. 

35 First, leaf disks of diameter 2.0 to 2.5 cm were punched out of 
m-vitro-cultured plants, using a sterile cork borer, and placed 
upside-down on a Petri dish with bombardment medium [MS salts 
if n^ _A i^ Ch) 4,3 9/1J aucroses 30 -° Phyto-agar (Duchefa, 

40 'I * ! W/V,; PH S ' 8; a£t6r aut ° cl * v "g, 1.0 mg/1 thiamine 

40 (Duchefa, T0614) and 0.1 g/l myo-inositol (Duchefa, 10609) were 
added). The underside of the leaf, which faced away from the 
agar, was subsequently bombarded using the particle gun. To this 
end, the plasmid DNA to be transformed (isolated from E. coli 
using Nucleobond AX100 Macherey & Nagel) was first applied to 

45 gold particles 0.6 p in side by the following protocol 

("coating",. First, 30 mg of powdered gold (BioRad) were taken up 
in ethanol. 60 ul of the gold suspension were transferred into a 
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fresh Eppendorf tube and the gold particles were sedimented by 
centrifugation (for 10 seconds). The gold particles were washed 
twice in in each case 200 (il of sterile water and, after a further 
centrifugation step r taken up in 55 jil of water. The following 
5 were added rapidly , with continuous mixing (vortexing): 

5 \xl plasmid DNA (1 ng/jil) 

50 nl 2.5 M CaCl 2 

20 (il 0.1 M spermidine 

10 

The suspension was subsequently vortexed for a further 3 minutes 
and subsequently centrifuged briefly. The go Id /DNA complexes 
which had sedimented were washed once or twice in in each case 
200 nl of ethanol and, after a further centrifugation step, 
15 finally taken up in 63 fil of ethanol* 3.5 jil (corresponding to 
100 \ig of gold) of this suspension were applied to a macrocarrier 
for each bombardment. 

The particle gun (BioRad, PDSlOOOHe) was prepared in accordance 

20 with the manufacturer's instructions, and the leaf explants were 
bombarded with the gold/ DNA complexes from a distance of 10 cm. 
The following parameters were used: vacuum: 27 inch Kg, pressure 
1100 pBi. After the bombardment, the explants were incubated for 
2 days in controlled-evironment cabinets (24°C, 16 h light, 8 h 

25 darkness) and subsequently divided into segments approximately 
0.5 cm 2 in size, using a surgical blade. These segments were then 
transferred to regeneration medium [bombardment medium 
supplemented with 1 mg/1 6-benzylaminopurine (BAP, Duchefa r 
B0904) and 0.1 mg/1 naphthylacetic acid (NAA, Duchefa, N0903)] 

30 supplemented with 500 mg/1 spectinomycin (Duchefa, 50188) and in- 
cubated for 10 to 14 days under the abovementioned conditions in 
a controlled-environment cabinet. After this period of time had 
elapsed, the leaf segments were transferred to fresh regeneration 
medium supplemented with 500 mg/1 spectinomycin. This procedure 

35 was repeated until green shoots formed on the explants. The 
shoots were removed using a surgical blade and grown on growth 
medium (like bombardment medium, but with 10 g/1 sucrose instead 
of 30 g/1 sucrose) supplemented with 500 mg/1 spectinomycin. 

40 to obtain as predominantly homoplastomic plants as possible, it 
is optionally possible to excise explants from the regenerated 
plants themselves and to place them on regeneration medium with 
1000 mg/1 spectinomycin. Regenerating shoots are transferred into 
glass containers with growth medium supplemented with 500 to 

45 
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1000 mg/i spectinomycin. After the plants have rooted, they are 
transferred into the greenhouse, where they are grown in soil 
until the seeds have matured. 

5 When the transformation was carried out with the plasmid 
pCB199-3, 8 plants with resistance to spectinomycin were 
obtained. PCR and Southern analyses proved that three of these 
lines (lines CB199NTB.-4, -6 and -8) have indeed incorporated the 
aadA gene into the plastidic genome. 

10 

To perform a Southern analysis of the transplastomic plants, 
total DNA from leaves was isolated from transformed and 
untrans formed plants with the aid of the GenElute Plant Genomic 
DNA Kit (Sigma). The DNA was taken up in 200 jil of eluate. 86 jxl 

15 portions of this were treated with in each case 10 ul of lOx 
restriction puffer and 40 u of restriction endonuclease and 
incubated for 4 to 8 hours at the temperature recommended for the 
restriction enzyme. The DNA was subsequently precipitated with 
ethanol in the manner known to the skilled worker and the 

20 precipitate was subsequently taken up in 20 ul of water. The 
samples were subsequently separated on agarose gel by methods 
known to the skilled worker, and the DNA was denatured in the gel 
and transferred to a nylon membrane by means of a capillary blot. 

25 A suitable probe for the radioactive hybridization was generated 
with the aid of the HighPrime (Roche) system. First, the membrane 
was prehybridized for 1 hour at 65°C with Hyb buffer (1% < w /v)) 
bovine serum albumin; 7% (w/v) SDS; 1 mM EDTA; 0.5 M sodium 
phosphate buffer, pH 7.2). The he at -denatured probe was 

30 subsequently added and left to hybridize overnight at 65'C. The 
blots were subsequently washed as follows: one rinse with * 
2 x SSPE/0.1% SDS; washing for 15 minutes at 65°C with 
2 x SSPE/0.1% SDS; washing for 15 minutes at 65°C with 
1 x SSPE/0.1% SDS; if appropriate, the last step was repeated 

35 again (20 x SSPE is 3 M NaCl; 0.2 M NaH 2 P0 4 ; 0.5 M EDTA; pH 7.4). 

The hybridization was subsequently analyzed with the aid of a 
phosphoimager (Molecular Imager FX, BioRad) . 

40 For example, Pstl-cut total DNA from different plants which had 
been regenerated after transformation with pCB199-3 were 
hybridized with the aadA gene as radiolabeled probe (793 bp 
Pstl/Ncol fragment from PCB199-3). Here, it was found that the 
lines CB199NTH-4, -6 and -8 had indeed incorporated the aadA gene 

45 into the DNA. Moreover, EcoRI- and Xhol-cut total DNA from 
CB199NTH-4, -6 and -8 was hybridized with a radiolabeled probe 
(1082 bp Bspl201/Sacl fragment from pCB199-3), which hybridizes 
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with part of the 16S rDNA. While , as expected , an approximately 
3100 bp band was detected in the wild type (untransformed plant), 
mostly a band at 1750 bp was detected in the transplastomic 
lines, as the result of the insertion of the insertion sequence 
5 from pCBl99-3 into the plastome (Fig. 9). The resulting plants 
can be considered as being being predominantly 
homotr ansplas tomic . 

Example 5: Generation of transformation vectors which can be used 
10 for transforming the plastids of master plants CB199NTH by means 
of the artificial homing process 

5.1 Cloning of the homing endonuclease I-Ppol 

15 The homing endonuclease I-Ppol was generated from 26 synthetic 
oligonucleotides by means of PCR, following a modification of 
the method of Stemmer WPC et al. (1995) Gene 164: 49-53 (SEQ ID 
NO: 11). The basic sequence was derived from the published 
sequence (Accession No. M38131 nucleotides 86 to 577). Here, a 

20 few mutations were introduced to remove restriction endonuklease 
recognition sites from the gene; however/ these mutations did not 
involve an altered amino acid sequence. The following elements 
were subsequently combined one after the other in a pBluescript 
KS (Stratagene) vector backbone in order to generate an I-Fpol 

25 expression cassette. The sequence is flanked by the cleavage 
sites Kpnl and Sad. 

a) Position 21 to 111: Prrn promoter 

30 b) Position 118 to 135: 5 ' -untranslated region of the rbcL 

gene followed by 18 bp encoding 6 amino acids of the rbcL 
protein (bp 136-152) 

c) Position 154 to 645: Nucleic acid sequence encoding I-Ppol. 

35 

d) Position 688-779: 3 '-untranslated region of the psbA gene. 

The resulting plasmid was named pCB289-13. Despite the expression 
of the enzyme I-PpoI r which was expected to take place in E. 
40 coli, no adverse effects on the growth were observed. The 
sequence described by SEQ ID NO: 4 resulted from the Kpnl 
cleavage site to the SacI cleavage site (vector backbone remains 
that of pBluescript KS). 

45 5.2 Generation of a transformation vector for artificial homing 
with homologous regions flanking the insertion sequence 
bilaterally 
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I) without I-Ppol in the insertion sequence 

Regions around the I-Ppol recognition region from pCB199-3 were 
excised using PstI and SacI and ligated into the PstI and Sad 
5 cleavage sites of pBluescript. Thereafter, cleavage sites which 
were not required were removed from this vector by linearizing it 
with PstI and Bspl20l and, following treatment with Klenow 
fragment, recircularizing the vector. With the aid of 
commercially available enzyme I-Ppol (PROMEGA GmbH, Mannheim 
10 Germany), the corresponding recognition region was cleaved in the 
resulting vector, and further cleavage sites were inserted 
therein by means of the synthetic oligos pl90 and P 19l. 



15 



Oligo pl90: 5 ' -GTCGACAGATCTTTAA-3 ' (SEQ ID NO: 20) 
Oligo pl91: 5 ' -AGATCTGTCGACTTAA— 3 ' (SEQ ID NO: 21) 



An expression cassette consisting of the following elements was 
introduced in the form of a Bglll / xhol fragment (SEQ ID NO: 6) 
20 into the cleavage sites Sail and Bglll, which had thus been 
introduced: 

a) Prpsl6 promoter (complementary to 1033-1139) 

25 b) 5'rbcL (complementary to 1007-1024) with 18 bp encoding the 
6 AAs (complementary to 989-1006) 

c) nptll gene (complementary to 185-988) 

30 d) 3'rbcL (complementary to 6-143) 

The resulting vector was named pCB304-25 and also conferred 
kanamycin resistance to E. coli cells. This vector is no longer 
linearized by commercially available I-Ppol. All of the insert of 
35 the vector pCB304-25 (backbone pBluescript; replacing the MCS 
accordingly from Sad to Kpnl) is described by SEQ id NO: 62 and 
thus comprises the following elements: 

a) Position bp 19 to 110: 3'psbA from tobacco 

40 

b) Position bp 149 to 160: nonfunctional "half of 

the I-Ppol recognition sequence 

c) Position bp 171 to 277: Prpsl6 promoter 

45 
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5 

e) 

f) Position bp 1167 to 1304: 
10 g) Position bp 1310 to 1319: 



96 

5'rbcL sequence followed by 18 bp 
encoding the first 6 amino 
acids of the rbcL protein 
(bp 304-321) 



3'rbcL 

nonfunctional "half" of 

the I-Ppol recognition sequence 



Position bp 322 to 1125: nptll 



II) With I-Ppol in the insertion sequence 



15 A Bglll / Muni fragment which encoded a 5'psbA — I-Ppol fusion 
was additionally introduced into the vector pCB304-25 with the 
aid of the BamHI and EcoRI cleavage sites ♦ The resulting vector 
pCB320-192 thus expressed the nptll gene and I-Ppol homing 
endonuclease under the control of the Prpsl6 promoter. The 

20 Bgl II / Mun I fragment is represented by SEQ ID NO: 63 and 
comprises the following elements: 



a) Position bp 6 to 82: 5'psbA 
25 b) Position bp 83 to 574: I-Ppol 



5.3 Generation of a transformation vector for artificial homing 
with a homologous region flanking the insertion sequence 
unilaterally 

30 

The elements located upstream of the Prpsl6 promoter and which 
are homologous to those in the master plants CB199NTH were 
removed from the vector pCB320-192 by restriction with Kpnl and 
Bglll. Instead, a BstXI cleavage site was introduced therein by 
35 means of synthetic oligonucleotides pi 99 and p200« 

pi 9 9 5 ' -GATCTCCAGTTAACTGGGGTAC-3 ' (SEQ ID NO: 22) 

p200 5 9 -CCCAGTTAACTGGA-3 ' (SEQ ID NO: 23) 

40 

DNA ends which are compatible with those originating by 
restriction with I-Ppol can now be generated by cleaving with 
BstXI. The resulting vector was renamed pCB322-l. 
A fragment which, at its one side, has an end which is compatible 
45 with DNA which had been cleaved with I-Ppol at its core 

recognition region and, at its other side, homology with plastome 
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7 / 

sequences of the master plants CB199NTH can be obtained from this 
vector for example using the enzymes BstXI and Sad. 

5.4 Generation of a transformation vector for artificial homing 
5 without homologous regions around the insertion sequence 

The remaining portion, which is homologous with recombinant 
plastid sequences of the master plants CB199NTH, was removed from 
the vector pCB322-l using Sacl and BspTI. Simultaneously, a BstXI 
10 cleavage site which, after cleavage with BstXI generates DNA ends 
which are compatible with I-Ppoi-cut DNA, was generated here by 
introducing synthetic oligonucleotides p218 and p219. The 
resulting vector was named pCB34 7-33. 

15 p218 5 ' -TTAAGCCAGTTAACTGGGCGGAGCT-3 ' (SEQ ID NO: 24) 
p219 5'-CCGCCCAGTTAACTGGC-3' (SEQ ID NO: 25) 

A fragment with bilateral DNA ends which are compatible with the 
20 DNA ends generated by the I-Ppol enzyme at its core recognition 
region can be isolated from this vector using the enzyme BstXI. 

Example 6: use of the master plants CB199NTH for plastid 
transformation by means of DSB induction 

25 

6.1 Using the vectors generated in 5.1 and 5.2 for transforming 
the master plants CB199NTH by using the DSBI enzyme 1-PpoI 

The plasmid pCB304-24 and the plasmid pCB289-13 were 
30 simultaneously applied to gold particles as described in 
Example 4 and used to bombard explants of the master plants 
CB199NTH, which explants had been treated analogously to what has 
been said in Example 4. However, the procedure differed from the 
decription in Example 4 insofar as incubation is first carried 
35 out for 10 days on the regeneration medium without antibiotics; 
later, kanamycin is used in a concentration of 50 mg/1 (in 
contrast to the 500 mg/1 spectinomycin stated in Example 4). 

The plasmid pCB320-192 was applied to gold particles as described 
40 in Example 4. After the ethanol washing step, 20 U of 

commercially available I-Ppoi enzyme (Promega) were additionally 
added. Further treatment was as described above. 

Also, in a different batch, 0.5 pg of a transcript generated in 
45 vitro with the aid of the T7 polymerase was applied 

simultaneously with the plasmid pCB320-192 to the gold particles. 
The template for the in-vitro transcription was 



0817/00028 



CA ^02470329 2004-06-14 



98 

Hindlll-linearized DNA of the plasmid pCB289-13. The transcript 
generated thus therefore encodes I-Ppol. After the bombardment , 
the treatment of the explants of the master plants continues as 
described above. 

5 

6.2 using the vectors generated in 5.1 and 5.3 for transforming 
the master plants CB199NTH by utilizing the DSBI enzyme 
I-Ppol and homologous regions which are only unilaterally 
present 

10 

A fragment excised from the plasmid pCB322-l with BstXI and SacI 
was eluted from an agarose gel. This fragment was subsequently 
applied to gold particles simultaneously with 1 \ig of in-vitro 
transcript of Hindlll-linearized plasmid pCB289-13 (cf . Example 
15 6.1). After the explants of the master plants CB199NTH have been 
bombarded, the rest of the treatment is as described for Example 
6.1. 

6.3 Using the vectors generated in 5.1 and 5.4 for transforming 
20 the master plants CB199NTH by utilizing the DSBI enzyme 

I-Ppol without homologous regions 

The insertion sequence was excised from the plasmid pCB347-33 by 
means of BstXI and eluted from an agarose gel. This fragment was 
25 applied to gold particles simultaneously with 1 ^g of in-vitro 
transcript of the Hindlll-linearized plasmid pCB289-13. The 
bombardment and the rest of the treatment are as detailed in 
Example 6.1. 

30 Example 7: Identification of naturally occurring, endogenous 
recognition regions for homing endonucleases 
in plastomes of different plant species 

Although no homing enconuc leases are known to occur in the 
35 plastids of higher plants, known plastome sequences were tested 
for the presence of recognition regions for homing endonucleases . 
This was done with the aid of the computer program SeqMan II 
(DNASTAR Inc.). The recognition regions which were identified in 
this manner are compiled in Table 1. 

40 

Based on the computer analysis, it was not possible to tell 
whether I-Scel has a recognition region in the plastidic genome 
or not. The region which is most likely to be able to act as 
recognition region was generated synthetically and integrated 
45 into the Xbal and Xhol cleavage site of pBluescript in the form 
of oligonucleotides p276 and p277. The resulting plasmid pCB414-l 
was subsequently analyzed with the aid of a commercially 
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available enzyme i-Scel (Roche) for the presence of a functional 
cleavage site. The plasmid was indeed linearized by I-Scel. This 
leads to the conclusion that I-Scel which is expressed in 
plastids likewise recognizes, and cleaves, this sequence. A 
5 further endogenous DSB recognition sequence for a DSBI enzyme has 
thus been identified. ^ 

p276 5 ' -TCGAGAAGATCAGCCTGTTATCCCTAGAGTAACT-3 ' (SEQ ID NO: 26) 

10 p277 5 ' -CTAGAGTTACTCTAGGGATAACAGGCTGATCTTC-3 ' (SEQ ID NO: 27) 

Example 8: Cloning homologous regions from the tobacco plastome 
which flank the endogenous recognition region for 
the homing endonuc lease I-Cpal 

15 

dna fragments from the 23S rDNA of the tobacco plastome were 
amplified by means of PCR upstream and downstream of the I-Cpal 
recognition region using the primers P 93 and p97, and p98 and 
p95, respectively. 

20 

p93: AAAGATCTCCTCACAAAGGGGGTCG (SEQ ID NO: 64) 

p97: TCGAAGACTTAGGACCGTTATAG (SEQ ID NO: 65) 

25 p98: AGGAAGACCTTGTCGGGTAAGTTCCG (SEQ ID NO: 66) 

p95: CTCAATTGGGGTCTCTCTGTCCAGGTGCAGG (SEQ ID NO: 67) 

The resulting fragments were used for constructing the vector 
30 pCB270-l. The fragment from BssHll to BssHII of the pBlueScript 
vector (SEQ ID NO: 8) is shown, the 5' end of the sequence 
indicated being located at the BssHII cleavage site which is 
closer to the 3' end of the lacZ gene. 

35 Two Bpil cleavage sites were introduced between the DNA fragments 
located upstream and downstream of the l-cpal recognition 
regions. Bpil generates overhangs which are outside their 
recognition region. This procedure ensured that the vector 
PCB270-1 could likewise be used for the subsequent ingegration of 

40 various introns. To this end, simple overhangs which are 

compatible with the ends generated by Bpil in the vector pCB270-l 
are generated at the introns to be cloned. Moreover, the 
respective nucleotides which are absent between the two fragments 
of the 23S rDNA in the vector pCB270-l are added onto the 

45 introns. The selected regions are so highly conserved that there 
is no need to amplify new regions from other plant species. 
Furthermore, a point mutation in the 23S rDNA, as has also been 
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found in lincomycin-resistant mutants, has been introduced into 
the sequence downstream of the I-Cpal recognition region via PCR 
strategy. The sequence of the vector pCB270-l which has been 
inserted into the pBluescript vector is shown in SEQ ID NO: 8. 
5 The sequence comprises the following elements: 

Fragment of the 23S rDNA upstream of the I-Cpal recognition 
region: nucleotides 37 to 194 

10 - Fragment of the 23SrDNA upstream of the I-Cpal recognition 
region: nucleotides 237 to 359 

Point mutation for lincomycin resistance: 352 (A being native 
at this point) 

15 

Vector pCB234-l is constructed just as vector pCB2 70-1, but 
additionally has a recognition region for each of the restriction 
enconuc leases Xhol and SacI downstream of the sequence shown 
hereinbelow. 

20 

Example 9: Cloning of the CpLSU2 intron including the homing 
endonuclease I-Cpal 

The CpLSU2 intron (SEQ ID NO: 15) was amplified from the DNA of 
25 the alga Chlamydomonas pallidostigmatica (Culture Collection of 
Algae at the University of Gdttingen, SAG Number 9.83, 
Chlamydomonas segnis, authentic strain of Chlamydomonas 
pallidostigmatica King) by means of PCR using the 
oligonucleotides p91 and p92. 

30 

p91 5 9 -AGAAGACGATCCTAAGG— 3 ' (SEQ ID NO: 28) 

p92 5 ' -TGAAGACTTGACAAGGAATTTCGC-3 ' (SEQ ID NO: 29) 

35 The oligonucleotides were chosen in such a way that cloning into 
the Bpil cleavage sites of the vector pCB234-l was possible, as 
described above. The sequence comprises the following elements: 

Position 9-17 - portion of the tobacco 2 3S rDNA which is 
40 absent in pCB234-l 

CpLSU2 intron: position 18-893 

I-Cpal ORF: position 377-835 

45 
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- Position 894-909 - portion of the tobacco 23S rDNA which is 
absent in pCB234-l 

This fragment, which comprises the CpLSU2 intron, was cloned into 
5 the backbone of the vector pGEMTeasy (Promega) (vector pCB14l-3). 
The entire fragment was excised from this vector using Bpii and 
cloned into the Bpil-linearized vector pCB234-l. The resulting 
vector was named pCB254-2. 3 

10 Example 10: Nuclear LSU-rRNA intron from Tetrahymena thermophila 

10.1 Cloning the LSU-rRNA intron from Tetrahymena thermophila 

The LSU-rRNA intron was amplified from the organism Tetrahymena 
15 thermophila by means of PCR. Again, the oligonucleotides pl02 and 
pi 03 were chosen in such a way that the nucleotides of the 
tobacco 23S rDNA, which are absent in pCB234-l, were added onto 
the intron to be amplified. 

20 pl02 (SBQ ID NO: 30): 

5 ' -AGAAGACGATCCTAAATAGCAATATTTACCTTT^GACCAAAAGTTATCAGGCATG-3 ' 
pl03 (SEQ ID NO: 31): 

5 'TGAAGACTTGACAAGGAATTTCGCTACCTTCGAGTACTCCAAAACTAATC-3 ' 

Moreover, the internal guide sequence (underlined in pl02) is 
mutated in such a way over the wild type, owing to the choice of 
the oligonucleotide pl02, that splicing of this intron at the 
desired position in the tobacco 23S rDNA is possible. The 
sequence shown in SEQ ID NO: 7 - the PCR fragment from the Bpii 
to Bpii cleavage site is shown - was cloned into the backbone of 
the vector pGEMTeasy. The resulting vector was named pCB220-17. 
The sequence comprises the following elements: 

35 - Position 9-12 - portion of the tobacco 23S rDNA which is 
absent in pCB234-l 

LSU intron: position 13-425 

40 - Position 426-446 - portion of the tobacco 23S rDNA which is 
absent in pCB234-l 

The Tetrahymena LSU intron including the added, flanking 
sequences, was excised from the vector pCB220-17 using Bpii and 
45 inserted into the Bpii cleavage sites of the vector pCB234-l. The 
resulting product was named pCB255-l. 



25 



30 
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10.2 Indirect detection of the splicing activity of the 
Tetrahymena LSU intron in E. coli 

To prove indirectly that the modified intron is indeed capable of 
5 splicing in the predetermined environment within the I-Cpal 
cleavage site, the modified Tetrahymena intron from pCB220-17 
together with a portion which surrounds the I-Cpal recognition 
region from the tobacco 23S rDNA was cloned in such a way into 
the lacz gene of pBluescript that, if this intron is spliced into 
10 E. coli (strain XLl-Blue), a functional lacZ peptide can be 

formed. The expression of the latter can be detected in suitable 
strains by methods with which the skilled worker is familiar by 
converting the substance 

5-bromo-4-chloro-3-indolyl-pD-galactopyranoside (X-Gal) in the 
15 medium into a blue pigment. This vector was named pCB315-l„ The 
lacZ gene including the introns in the vector pCB315-l is 
described by SEQ ID NO: 9. The vector backbone is identical with 
pBluescript. The sequence comprises the following elements: 

20 - lacZ-5' portion: complementary (789-765) 

multiple cloning site from pBluescript: complement axy 
(764-692) 

25 - 23S rDNA fragment upstream and including the I-Cpal 
recognition region: complementary (691-682) 

modified Tetrahymena intron: complementary (681-269) 

30 - 23S rDNA fragment upstream and including the I-Cpal 
recognition region: complementary (268-244) 

multiple cloning site from pBluescript: complementary 
(243-168) 

35 

lacZ-5' portion: complementary (167-1) 

A plasmid which corresponds to pCB315-l f but which plasmid 
(PCB305-1) lacks the element for the modified Tetrahymena intron, 

40 was generated for control purposes. pCB305-l thus acted as 
positive control to demonstrate that lacz, with the tobacco 
plastome 23S rDNA nucleotides incorporated in the reading frame 
is still functional. This reflects the situation after correct 
splicing of the Tetrahymena intron. XLl-Blue competent cells were 

45 transformed with the plasmids pCB315-l and pCB305-l by means of a 
method with which the skilled worker is familiar. In each case 
one individual colony was incubated on LB (Bactotryptone: 10 g/1, 
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yeast extract: 5 g/1, NaCl: 10 g/1, pH 7.5) plates comprising 
15 g/1 Bacto agar, 40 \ig/l ampicillin, 75 ug/1 IPTG 
(isopropyi-po-thiogalactopyranoside) and 80 ug/1 X-Gal overnight 
at 37°C. In fact, both clones turned blue, which suggests that the 
5 modified Tetrahymena intron was spliced in the non-natural envi- 
ronment of the tobacco 23S rDNA in the heterologous organismus E. 
coli . 

10.3: Introduction of further sequences into the Tetrahymena LSU 
10 intron 



in addition to the experiments in Example 10.2, the possibility 
of incorporating further elements into the modified Tetrahymena 
intron without destroying the splicing activity was studied. To 

15 this end, pCB315-l was linearized with Bglll and the overhangs 
were filled up with the aid of Klenow fragment. Then, an 
Xhol-Saci fragment as is found in pCB199-3 was therefore cloned 
into this vector, likewise after treatment with Klenow fragment. 
A 229 bp fragment was thus inserted into the modified intron by 

20 this cloning step. This fragment comprises an I-Ppol recognition 
region. Independently of the orientation in which the 229 bp 
fragment inserted into the Tetrahymena intron, a blue coloration 
was detected in the test as described in Example 10.2. This 
suggests that the Tetrahymena intron is capable both of splicing 

25 at the desired region in the 23S rDNA and of incorporating 
additional genetic information while nevertheless retaining a 
splicing activity. 

10.4 Transformation of a natural master plant and destruction of 
30 the endogenous I-Cpal recognition region with the modified 

Tetrahymena intron 

The modified Tetrahymena intron was excised from the vector 
PCB220-17 using Bpil and cloned into the Bpil linearized vector 
35 pCB234-l as described in principle in Example 8. The resulting 
vector was named pCB255-l. 

PCB255-1 is applied to gold particles simultaneously with in 
vitro transcript of pCB2€2-5 (linearized with Sail, using T7 
polymerase) by the method described in Example 4. These gold 
40 particles are subsequently used to bombard tobacco plants cv. 
Petit Havana analogously to the method described in Example 4. if 
appropriate, the explants can be selected on lincomycin f250 to 
500 mg/1). 



45 
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Example 11: LI • LtrB intron from Lactococcus lactis 

The Ll.LtrB intron including few bases of the flanking exon 
sequences was amplified from Lactococcus lactis by means of PCR 
5 using the primers p207 and p208. The PCR product was cloned into 
the vector pCR2.1TA (Invitrogen) (pCB345-34) and sequenced {SEQ 
ID NO: 10) • Few deviations in comparison with the published 
sequence were found. 

10 p207 5 ' -GAGAAGACATTCCTAACACATCCATAACGTGCG-3 ' (SEQ ID NO: 32) 

p208 5 ' -TGAAGACTTGACATTTGATATGGTGAAGTAGG-3 ' (SEQ ID NO: 33) 

The cloned fragment in pCB345-34 (from the EcoRI cleavage site to 
15 the EcoRI cleavage site of the pCR2.1TA vector) is represented in 
SEQ ID NO: 10. The remainder of the vector is identical with the 
backbone of pCR2.1TA. The sequence comprises the following 
elements: 

20 - Portion of the natural 5' exon: complementary (2540-2527) 
Intron Ll.LtrB: complementary (2526-35) 
ORF in the intron: complementary (1953-154) 

25 

Portion of the natural 3' exon: complementary (34-28) 

Example 12: Generation of a further derivative of the Tetrahymena 
LSU intron, and incorporation of a foreign gene into 
30 this intron derivative, and transformation into 

natural master plants 

In accordance with a preferred embodiment of the present 
invention, an artificial intron was generated, which intron can 

35 be incorporated into the plastidic genome at precisely the 
position where the natural intron belonging to the dsb 
recognition sequence under investigation resides. In the present 
example, the Tetrahymena LSU intron was modified in such a way 
that it is capable of splicing at the position marked at the 

40 recognition site, identified within the scope of the present 

invention, for the DSBI enzyme I-Cpal in the plastidic genome of 
higher plants: CGATCCTAAGGT* AGCGAAATTCA • 

The gene encoding I-Cpal including an RBS was subsequently 
45 incorporated into the intron. This gave rise to an intron which 
has splicing activity, bears a foreign gene and which can be 
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incorporated, by means of the process found within the scope of 
the present invention, into the plastids of a natural master 
plant within an essential gene (encoding the 23S rRNA) . 

5 12.1: Generation of a further Tetrahymena LSD intron derivative 

To obtain a functional intron derivative at a predefined 
insertion site, the internal guide sequence (ICS) must be adapted 
in such a way that it is capable of undergoing base pairing with 

10 the 5' and the 3' exon. Pig. 10 illustrates how this adaptation 
was carried out in the present example in order to generate a 
Tetrahymena intron derivative which is capable of splicing at the 
natural insertion site of the CpLSUS intron within the I-Cpal 
recognition region. An adaptation to any desired insertion site 

15 can be carried out analogously. The intron generated within the 
scope of the present example was named TetIVS2a and is described 
by SEQ ID NO: 73. scnoea 

12.2: Indirect detection of the splicing activity of the 
20 TetlVS2a intron in E. coli 

Tetivs2a was incorporated into the lacz gene of pBluescript 
analogously to Example 10.2. After suitable incubation of E. coli 
XLl-blue cells which comprised the plasmid pCB459-l, a blue 
25 ^loration indicated the splicing activity of the TetIVS2a intron 
at the desired position. 

Components of the insert from plasmid pCB459-l (SEQ id NO: 74- 
backbone corresponds to pBluescript) ' 

30 

- lac2-3' portion including parts of the multiple cloning site 
from pBluescript (complementary to position 1-254) 

- Sequence from the I-Cpal recognition region (complementary to 
position 254-265) x 

35 - TetIVS2a (complementary to position 266-678) 

- Sequence from the I-Cpal recognition region (complementary to 
position 679-687) * 

- lacZ-5' portion including parts of the multiple cloning site 
from pBluescript (complementary to position 688-791) 

40 

12.3: Introduction of further genetic information into the 

TetIVS2a intron and detection of the splicing activity in 
E. coli 

45 In this example, the gene encoding the DSBI enzyme I-Cpal is 
introduced into TetlvS2a without the latter losing its splicing 
activity at said position within the I-Cpal recognition region. 
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To this end, a Bell cleavage site was first introduced, by PCR, 
into the sequence segment of TetIVS2a which corresponds to loop 
L8 in the Tetrahymena LSU intron. A nonfunctional derivative of 
the gene encoding I-Cpal was then incorporated into this Bell 
5 cleavage site. Since the expression of I-Cpal in E. coli is 

toxic, it was necessary to use, for the splice test in E.coli, a 
nonfunctional gene which had previously been generated by 
linearizing the gene in question at the EcoRI cleavage site, 
making the overhangs blunt-ended by Klenow treatment and 

10 subsequently religating the gene segments. This resulted in a 
reading-frame shift in the gene. Incorporation of said intron 
with the nonfunctional gene into the lacZ gene of pBluescript 
gave rise to the plasmid pCB478-3 and, again, it was possible to 
detect the splicing activity of this intron in E • coli at the 

15 desired position within the I-Cpal recognition site by means of 
the blue coloration of colonies in question, analogously to 
Example 12.2. Since the functional gene encoding I-Cpal differs 
from the nonfunctional gene used in pCB478-3 by only 4 bases, it 
can be assumed that the intron retains the desired splicing 

20 activity, even after the functional, instead of the 
nonfunctional, I-Cpal gene has been incorporated. 

Components of the insert of plasmid pCB4 78-3 (SEQ ID NO: 75; 
backbone corresponds to pBluescript) 

25 

lacZ-3' portion including parts of the multiple cloning site 
from pBluescript (complementary to position 1-265} 
Sequence from the I-Cpal recognition region ( complementary to 
position 256-265) 
30 - TetIVS2a (complementary to position 266-1178), comprising a 
nonfunctional gene for I-Cpal (complementary to position 
399-861) and an RSB upstream of the nonfunctional I-Cpal gene 
(complementary to 866-870) 

Sequence from the I-Cpal recognition region (complementary to 
35 position 1179-1187) 

- lacZ-5' portion including parts of the multiple cloning site 
from pBluescript (complementary to position 1179-1291) 

12.4: Transformation of a self -disseminating, artificial intron 
40 in a natural master plant 

After it had been demonstrated that the TetIVS2a intron is 
capable of splicing at the desired position and of simultaneously 
incorporating further genetic information without losing this 
45 splicing activity, a construct was generated which is intended to 
make possible that the I-Cpal gene can be introduced into the 
plastidic genome by means of the method described within the 



0817/00028 



CA 02470329 2004-06-14 



107 

scope of the present invention in the form of a foreign gene 
without using a selection marker. To this end, the vector 
PCB492-2S, which comprises an insert with the following elements 
was first generated (SEQ ID NO: 76; backbone corresponds to that 
5 of pBluescript; sequence from BssHll to BssHII in pBluescript is 
indicated, the BssHII cleavage site indicated here at the 5' end 
is the BssHII cleavage site in pBluescript which is localized 
closer to the 3' end of the lacz gene): 

10 - 23S rDNA fragment upstream of and including the I-Cpal 
recognition region (position 37-203) 

- TetIVS2a (position 204-1112) with inserted gene encoding 
I-Cpal (position 521-979) and RBS (position 512-516) 
23S rDNA fragment downstream of and including the I-Cpal 

15 recognition region (position 1113-1247) 

To ensure expression of I-Cpal directly after the introduction 
into plastids of natural master plants, a promoter was added in 
vitro upstream of said intron Cpa construct by means of PCR. The 
20 primers p396 and P 95 and, as template, pCB492-25 were used for 
this purpose. 

p396 (SEQ ID NO: 77): 

5 ' -TAGTAAATGACAATTTTCCTCTGAATTATATAATTAACATGGCGACTGTTTACCAAAAAC-3 

25 

p95 (SEQ ID NO: 78): 5 ' -CTCAATTGGGGTCTCTCTGTCCAGGTGCAGG-3 ' 

The resulting PCR product was named Prom-TetIVS2a-Cpa, is 
described by SEQ ID NO: 79 and comprised the following elements: 

30 

synthetic promoter (position 8-40) 

tobacco 23S rDNA upstream of and including portions of the 
I-Cpal recognition region (position 41-207) 

- TetIVS2a (position 208-1116) comprising gene encoding I-Cpal 
35 (position 525-983) and RBS (position 516-520) 

tobacco 23S rDNA downstream of and including portions of the 
I-Cpal recognition region (position 1117-1243) 

The plasmid pCB492-25 was applied to gold particles 
40 simultaneously with the above-described PCR product 

Prom-TetIVS2a-C P a as described in Example 4 and subsequently used 
to bombard wild-type tobacco. By expression of the I-Cpal enzyme, 
it was intended to bring about a double-strand break in the 
23S rDNA, which double-strand break is repaired by the PCR 
45 product which has been introduced or by the insertion sequence, 
of the plasmid pCB4 92-25, which has been introduced. The I-Cpal 
recognition region, which is naturally present, is thereby 
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inactivated in the transformed plastome copies. Plants were 
regenerated without any selection pressure, and these plants are 
tested by PCR for the presence of the insertion sequence in the 
plastome • 

5 

Example 13: Dissemination of the modified Tetrahymena LSU intron 
from pCB255-l in a natural master plant by expression 
of the DSBI enzyme I-Cpal in trans 

10 This example shows how a DSBI enzyme can be expressed in the 

plastids of master plants in order to efficiently disseminate an 
insertion sequence in the copies of the master plant. 

13.1: Generation of a vector for the transformation of plastids 
15 which permits the expression of the homing endonuclease 

I-Cpal in plastids 

First, a vector which encodes the selection marker aadA and the 
DSBI enzyme I-Cpal was generated. Since expression of the I-Cpal 

20 enzyme is lethal in E. coli, the accD promoter was chosen in 

order to allow for the expression of this enzyme in the plastids, 
but to prevent the expression in E. coli. Thus, it was possible 
to generate and amplify this vector in a conventional manner with 
E • coli as the host organism. The resulting vector was named 

25 pCB435-45 and comprised an insert as shown in SEQ ID NO: 80 with 
the following elements: 



- Right-hand target region (as in pCB42-94, see above; 
complementary to position 66-1403) 
30 - promoter PaccD (position 1422-1478) 
RBS (position 1500-1504) 

Gene encoding I-Cpal (position 1509-1967) 

Expression cassette for the marker gene aadA consisting of: 
the 3' region of the psbA gene (complementary to position 

35 2065-1974) 

aadA gene (complementary to position 2872-2078) 
5 '-untranslated regions of the tobacco rbcL gene 
(complementary to position 2890-2873), partly mutated 
Promoter of the gene for the 16S rRNA (complementary to 

40 position 2987 to 2897) 

Left-hand target region (as in pCB42-94, see above; 
complementary to position 3863-3007) 

Portions of the pBluescript (including origin of replication; 
positions 3864-4746 and 1-65) 



45 
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13.2 ^transformation of pCB435-45 and pCB255-l into natural 
master plants 

The plasmids pCB435-45 and pCB255-l were applied simultaneously 
5 to gold particles as detailed in Example 4 and then introduced 
into plastids of tobacco leaves by means of the particle gun 
Transplastomic plants were selected on regeneration medium 
supplemented with 500 mg/1 spectinomycin as described in Example 
4. As soon as plantlets had formed, they were transferred to 

10 growth medium supplemented with 500 mg/1 spectinomycin, and leaf 
material was harvested. This leaf material was analyzed by 
Southern analysis using the Dig-Easy Hyb* (Roche Diagnostics- 
Mannheim) for the incorporation of the two plasmids into the' 
plastidic genome. A probe with a sequence as shown in SEQ ID NO* 

15 81 was used to determine the percentage of plastome copies which 
were transgenic with regard to the insertion sequence from 
PCB435-45 (probe directed against portions of the 16S rDNA). 

A probe with a sequence as shown in SEQ id NO: 82 was used to 
20 determine the percentage of plastome copies which were transgenic 
with regard to the insertion sequence from P CB255-1 (probe 
directed against portions of the 23S rDNA). 

Fig. 11 shows that, in this experiment, 2 lines (CB255+435NTH-19 
25 and -20) were identified which are transgenic with regard to both 
the insertion sequence of pCB435-45 and that from pCB255-l It 
was furthermore demonstrated in this manner that, surprisingly, 
the insertion sequence from pCB255-l (modified Tetrahymena LSU 
intron) had already been disseminated into more copies of the 
30 plastidic genome than the insertion sequence from DCB435-45, even 
though the selection had been carried for the event of the 
insertion of the insertion sequence from pCB435-45 (aadA marker 
gene resides in pCB435-45). The efficiency of the method 
described within the present invention - viz. the insertion and 
35 rapid dissemination of an insertion sequence in the plastidic 
genome without selecting for the presence of this insertion 
sequence by utilizing DSBI enzymes and suitable recognition sites 
- has thus been demonstrated for said lines in the present 
example . 

40 



45 
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Example 14 : Generation of further master plants with a DSB 

recognition region which does not naturally occur in 
plastids, and transformation of these plants 
utilizing the DSBI enzyme I-Ppol 

5 

14.1: Generation of a further vector (pCB456-2) for introducing 
a non-naturally-occurring recognition region for the 
homing endonuclease I-Ppol into the plastome of tobacco 

10 The purpose of this approach was (analogously to Example 3) to 
generate a further vector for the transformation of plastids, 
which vector has no extensive homologies with sequences in the 
plastidic genome, 

15 In this plasmid, the selection marker aadA is under the control 
of a synthetic promoter which is derived from the consensus 
sequence for E • coli o70 promoters. A region downstream of the 
Synechocystis 3'psbA-l gene was used as the 3' end. In contrast 
to the vector pCBl99-3 which has already been described, the DSB 

20 recognition sequence was here introduced into the molecule 
immediately downstream of the aadA gene, but upstream of the 
Synechocystis 3'psbA-l sequence. An operon can be generated 
thereby with the aid of a DSBI enzyme following insertion of an 
insertion sequence. The genes on the insertion sequence can then 

25 be optionally inserted on the insertion sequence without 

promoter. After the insertion, suitable genes of the insertion 
sequence then also come under the control of the synthetic 
promoter upstream of the aadA gene in the master plant. An operon 
structure consisting of the aadA and the subsequently introduced 

30 genes can thereby optionally be generated in the plastome. 

Various elements were cloned one after the other into the basic 
vector pCB42-94 in order to generate the plasmid pCB456-2: 

35 - Synthetic promoter (complementary to bp 1226-1260) 

Ribosome binding site (complementary to bp 1214-1218) 
aadA gene (complementary to bp 414-1208) 

Core recognition region for the homing endonuclease I-Ppol 
(complementary to bp 331-345) 
40 - 3'psbA-l from Synechocystis (complementary to bp 19-155) 

The vector thus obtained also confers spectinomycin resistance in 
E. coli. This vector, which is named pCB456-2, comprises the 
abovementioned elements within the nucleic acid sequence with the 
45 SEQ ID NO: 83, instead of the multiple cloning site in the basic 
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vector PCB42-94 for the transformation of plastids. Again, all of 
the sequence which replaces MCS (from Sad to Kpnl) is indicated. 

14.2: Generation of predominantly homotransplastomic master 
5 plants which comprise a nonnatural dsb recognition 

sequence 

The vector pCB456-2 was introduced into the plastids of tobacco 
analogously to pCB199-3 in Example 4. However, as opposed to the 

10 description in Example 4, the shoots obtained were grown on 
growth medium comprising 30 g/1 sucrose (instead of the 10 g/l 
stated in Example 4). The resulting plants were named CB456NTH 2 
lines which have the insertion sequence from pCB456-2 
incorporated into their plastome (CB456NTH-1 and -15, cf 

15 Fig. 12) were identified among the spectinomycin-resistant plants 
obtained after the transformation, using Southern hybridization 
A probe which was directed against a fragment of the 16S rDNA was 
employed in the Southern experiment (cf. Example 13.2 above) 
This probe was suitable for detecting an approx. 3.1 kb fragment 

20 from EcoRI-digested DNA corresponding to the wild type. In 
contrast, an approx. 1.7 kb fragment was detected when the 
insertion sequence from pCB456-2 had been incorporated into the 
corresponding plastome copies. 

25 14.3: Generation of a transformation vector for artificial 
homing in the master plants CB456NTH 

First, an operon structure consisting of the elements rbs - nptll 
(encoding an enzyme which confers kanamycin resistance) - rbs - 

30 I-P P oI (encoding a DSBI enzyme) was generated. This cassette was 
surrounded with Bstxi cleavage sites which, after exposure to the 
enzyme BstXI, generate DNA ends which are compatible with the DNA 
ends generated by the enzyme I-Ppol. The resulting vector 
(backbone corresponds to that of pBluescript) was named pCB528-2 

35 and comprises an insert as shown in SEQ id NO: 84 with the 
following elements: 

RBS (position 28-32) 
nptll (position 27-840) 
40 - RBS (position 849-853) 

Gene encoding I-ppol (position 859-1350) 

The 1360 bp fragment was subsequently excised from PCB528-2 using 
BstXI and ligated into the I-Ppol cleavage site in the vector 
45 pCB456-2. Clones with kanamycin resistance were selected from 
those obtained after the ligation products had been transformed 
into E. coli. it was thereby ensured that said insert in the 
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clone in question was inserted in the vector in such a way that 
the nptll and I-Ppol cassettes had the same orientation as the 
aadA cassette. This was also verified by the restriction analysis 
method, with which the skilled worker is familiar. The vector in 
5 question was named pCB535-ll. 

14.4: Transformation of pCB535-ll into master plants CB456NTH 
pCB535-ll was applied to gold particles as described for pCB456-2 
in Example 14.2 and subsequently introduced into plastids of the 
10 master plant CB456NTH-1 using the particle gun. 

Some of the explants were incubated on regeneration medium 
without any selection pressure. Resulting plants were transferred 
to growth medium (again without selection pressure). Therafter, 
15 the plants are analyzed by PGR for the presence of the RBS - 
nptll — RBS — I-Ppol cassette. 

Other explants were incubated on regeneration medium supplemented 
with 15 or 30 mg/1 kanamycin. After 2 weeks, the plants are 
20 transferred to fresh regeneration medium and the kanamycin 

concentration increased stepwise to 50 and 80 mg/1, respectively. 



25 



30 



35 



40 



45 
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SEQUENCE LISTING 
<110> SunGene GmbH & Co.KGaA 

<120> Method for the transfonnatioa of vegetal plastids 

<130> AE20010854 / PF 0081700028 

<140> 
<141> 

<160> 84 

<170>PatentIn'Ver. 2.1 

<210> 1 
<211> 4363 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: plastid 
transformation plasmid pCB42-94 

<220> 

<221>misc feature 

<222> (55)^1405) 

<223> right targeting region 

<220> 

<221> mutation 
<222> (346) 

<223> mutation causing streptomycin resistance 
<220> 

<221> mutation 
<222> (68) 

<223> mutation causing spectinomycin resistance 
<220> 

<221>misc feature 
<222> (1404)..(1511) 
<223> multiple cloning site 

<220> 

<221> misc feature 
<222> (2629)..(3417) 
<223> Ampicillin resistance 

<400> 1 

gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagtgatga 60 
cttgacggca tcctcacctt cctccggctt atcaccggca gtctgttcag ggttccaaac 120 
tcaacgatgg caactaaaca cgagggttgc gctcgttgcg ggacttaacc caacacctta 180 
cggcacgagc tgacgacagc catgcaccac ctgtgtccgc gttcccgaag gcacccctct 240 
ctttcaagag gattcgcggc atgtcaagcc ctggtaaggt tcttcgcttt gcatcgaatt 300 
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aaaccacatg ctccaccgct tgtgcgggcc cccgtcaatt ccttttagtt tcattcttgc 360 
gaacgtactc cccaggcggg atacttaacg cgttagctac agcactgcac gggtcgatac 420 
gcacagcgcc tagtatccat cgtttacggc taggactact ggggtatcta atcccattcg 480 
ctcccctagc tttcgtctct cagtgtcagt gtcggcccag cagagtgctt tcgccgttgg 540 
tgttctttcc gatctctacg catttcaccg ctccaccgga aattccctct gcecctaccg 600 
tactccagct tggtagtttc caccgcctgt ccagggttga gccctgggat ttgacggcgg 660 
acttaaaaag ccacctacag acgctttacg cccaatcatt ccggataacg cttgcatcct 720 
ctgtattacc gcggctgctg gcacagagtt agccgatgct tattccccag ataccgtcat 780 
tgcttcttct ccgggaaaag aagttcacga cccgtgggcc ttctacctcc acgcggcatt 840 
gctccgtcag gctttcgccc attgcggaaa attccccact gctgcctccc gtaggagtct 900 
■ gggccgtgtc tcagtcccag tgtggctgat catcctctcg gaccagctac tgatcatcgc 960 
cttggtaagc tattgcctca ccaactagct aatcagacgc gagcccctcc tcgggcggat 1020 
tcctcctttt gctcctcagc ctacggggta ttagcagccg tttccagctg ttgttcccct 1080 
cccaagggca ggttcttacg cgttactcac ccgtccgcca ctggaaacac cacttcccgt 1140 
ccgacttgca tgtgttaagc atgccgccag cgttcatcct gagccaggat cgaactctcc 1200 
atgagattca tagttgcatt acttatagct tccttgttcg tagacaaagc ggattcggaa 1260 
ttgtctttca ttccaaggca taacttgtat ccatgcgctt catattcgcc cggagttcgc 1320 
tcccagaaat atagccatcc ctgccccctc acgtcaatcc cacgagcctc ttatccattc 1380 
tcattgaacg acggcggggg agcgagctcc accgcggtgg cggccgctct agaactagtg 1440 
gatcccccgg gctgcaggaa ttcgatatca agcttatcga taccgtcgac ctcgaggggg 1500 
ggcccggtac caaatccaac tagaaaaact cacattgggc ttagggataa tcaggctcga 1560 
actgatgact tccaccacgt caaggtgaca ctctaccgct gagttatatc ccttccccgc 1620 
cccatcgaga aatagaactg actaatccta agtcaaaggg tcgagaaact caacgccact 1680 
attcttgaac aacttggagc cgggccttct tttcgcacta ttacggatat gaaaataatg 1740 
gtcaaaatcg gattcaattg tcaactgccc ctatcggaaa taggattgac taccgattcc 1800 
gaaggaactg gagttacatc tcttttccat tcaagagttc ttatgcgttt ccacgcccct 1860 
ttgagacccc gaaaaatgga caaattcctt ttcttaggaa cacatacaag attcgtcact 1920 
acaaaaagga taatggtaac cctaccatta actacttcat ttatgaattt catagtaata 1980 
gaaatacatg tcctaccgag acagaatttg gaacttgcta tcctcttgcc tagcaggcaa 2040 
agatttacct ocgtggaaag gatgattcat tcggatcgac atgagagtcc aactacattg 2100 
ccagaatcca tgttgtatat ttgaaagagg ttgacctcct tgcttctctc atggtacact 2160 
cctcttcccg ccgagcccct tttctcctcg gtccacagag acaaaatgta ggactggtgc 2220 
caacaattca tcagactcac taagtcggga tcactaacta atactaatct aatataatag 2280 
tctaatatat ctaatataat agaaaatact aatataatag aaaagaactg tcttttctgt 2340 
atactttccc cggttccgtt gctaccgagg gcctcgtgat acgcctattt ttataggtta 2400 
atgtcatgat aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg 2460 
gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat 2520 
aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc 2580 
gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa 2640 
cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac 2700 
tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga 2760 
tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag 2820 
agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca 2880 
cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca 2940 
tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa 3000 
ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc 3060 
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tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa 3120 
cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag 3180 
actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct 3240 
ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac 3300 
tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa 3360 
ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt 3420 
aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat 3480 
ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg 3540 
agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc 3600 
ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg 3660 
tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag 3720 
cgcagatacc aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact 3780 
ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg 3840 
gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc 3900 
ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg 3960 
aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg 4020 
cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag 4080 
ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc 4140 
gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct 4200 
ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc 4260 
ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc 4320 
gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga aga 4363 
<210> 2 
<211> 1359 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: Insert of 
vector pCB199-3 

<400>2 

gagctctgat cacggaagat agctttggca aaaaaagcaa aaagcattta ccttgattga 60 
gatgttaatt gtgttggcaa ttatcagtat tttaattttg ctttttgtgc caaatttgat 120 
actagagctt cgggtgccag ggcgtgccct tgggctcccc gggcgcgtac tcgacgctac 180 
cttaagagag tcaagcttct atattaccct gttatcccta gcgtactcga gaaaaaaaga 240 
aaggagcaat agcaccctct tgatagaaca agaaaatgat tattgctcct ttcttttcaa 300 
aacctcctat agactaggcc aggaattatc tgcagttatt tgccaactac cttagtgatc 360 
tcgcctttca cgtagtggac aaattcttcc aactgatctg cgcgcgaggc caagcgatct 420 
tcttcttgtc caagataagc ctgtctagct tcaagtatga cgggctgata ctgggccggc 480 
aggcgctcca ttgcccagtc ggcagcgaca tccttcggcg cgattttgcc ggttactgcg 540 
ctgtaccaaa tgcgggacaa cgtaagcact acatttcgct catcgccagc ccagtcgggc 600 
ggcgagttcc atagcgttaa ggtttcattt agcgcctcaa atagatcctg ttcaggaacc 660 
ggatcaaaga gttcctccgc cgctggacct accaaggcaa cgctatgttc tcttgctttt 720 
gtcagcaaga tagccagatc aatgtcgatc gtggctggct cgaagatacc tgcaagaatg 780 
tcattgcgct gccattctcc aaattgcagt tcgcgcttag ctggataacg ccacggaatg 840 
atgtcgtcgt gcacaacaat ggtgacttct acagcgcgga gaatctcgct ctctccaggg 900 
gaagccgaag tttccaaaag gtcgttgatc aaagctcgcc gcgttgtttc atcaagcctt 960 
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acggtcaccg taaccagcaa atcaatatca ctgtgtggct tcaggccgcc atccactgcg 1020 
gagccgtaca aatgtacggc cagcaacgtc ggttcgagat ggcgctcgat gacgccaact 1080 
acctctgata gttgagttga tacttcggcg ataaccgctt cacgagccat ggctgtttgt 1140 
ggtgtcatgg ctgtttgtgg tgtcatgaat ccctccctac aactagatcc tcgcccggag 1200 
ttcgctccca gaaatatagc catccctgcc ccctcacgtc aatcccacga gcctcttatc 1260 
cattctcatt gaacgacggc gtcgaggggg ggcccggtac gtcgacgaag ttcctattcc 1320 
gaagttccta ttctcaagaa agtataggaa cttcgtacc 1359 

<210> 3 
<211> 1070 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: insert of 
vector pCB401-20 

<400>3 

gagctctgat cacggaagat agctttggca aaaaaagcaa aaagcattta ccttgattga 60 
gatgttaatt gtgttggcaa ttatcagtat tttaattttg ctttttgtgc caaatttgat 120 
actagagctt cgggtgccag ggcgtgccct tgggctcccc gggcgcgtac tcgacgctac 180 
cttaagagag tcaagcttct atattaccct gttatcccta gcgtactcga gctgcagtta 240 
tttgccaact accttagtga tctcgccttt cacgtagtgg acaaattctt ccaactgatc 300 
tgcgcgcgag gccaagcgat cttcttcttg tccaagataa gcctgtctag cttcaagtat 360 
gacgggctga tactgggccg gcaggcgctc cattgcccag tcggcagcga catccttcgg 420 
cgcgattttg ccggttactg cgctgtacca aatgcgggac aacgtaagca ctacatttcg 480 
ctcatcgcca gcccagtcgg gcggcgagtt ccatagcgtt aaggtttcat ttagcgcctc 540 
aaatagatcc tgttcaggaa ccggatcaaa gagttcctcc gccgctggac ctaccaaggc 600 
aacgctatgt tctcttgctt ttgtcagcaa gatagccaga tcaatgtcga tcgtggctgg 660 
ctcgaagata cctgcaagaa tgtcattgcg ctgccattct ccaaattgca gttcgcgctt 720 
agctggataa cgccacggaa tgatgtcgtc gtgcacaaca atggtgactt ctacagcgcg 780 
gagaatctcg ctctctccag gggaagccga agtttccaaa aggtcgttga tcaaagctcg 840 
ccgcgttgtt tcatcaagcc ttacggtcac cgtaaccagc aaatcaatat cactgtgtgg 900 
cttcaggccg ccatccactg cggagccgta caaatgtacg gccagcaacg tcggttcgag 960 
atggcgctcg atgacgccaa ctacctctga tagttgagtt gatacttcgg cgataaccgc 1020 
ttcacgagcc atggttccct ccctacaacg tcgagggggg gcccggtacc 1070 

<210>4 
<211> 804 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: insert of 
vector pCB289- 13 

<220> 
<221> CDS 
<222> (154)..(642) 

<223> sequence coding for I-Ppol homing-cndonuclease 
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<400> 4 

ggtaccgggc cccccctcga cgccgtcgtt caatgagaat ggataagagg ctcgtgggat 60 
tgacgtgagg gggcagggat ggctatattt ctgggagcga actccgggcg aggatctagt 120 
tgtagggagg gattcatgac accacaaaca gcc atg gcg etc acc aat get caa 174 
Met Ala Leu Thr Asn Ala Gin 
1 5 

ate ttg get gtg att gac agt tgg gaa gaa aca gtc ggt cag ttt cca 222 
lie Leu Ala Val De Asp Ser Trp Glu Glu Thr Val Gly Gin Phe Pro 
10 15 20 

gtg ata acg cac cat gta cca tta ggt ggc ggt ctg caa gga acg etc 270 
Val De Thr His His Val Pro Leu Gly Gly Gly Leu Gin Gly Thr Leu 
25 30 35 

cat tgt tac gag ate ccc eta gca get cct tat ggg gtt ggc m get 318 
His Cys Tyr Glu De Pro Leu Ala Ala Pro Tyr Gly Val Gly Phe Ala 
40 45 50 55 

aag aat ggg cct acc cgc tgg caa tac aaa egg aca ate aat caa gtc 366 
Lys Asn Gly Pro Thr Arg Trp Gin Tyr Lys Arg Thr lie Asn Gin Val 
60 65 70 

gtc cac aga tgg ggg tec cac aca gtc cct ttt eta tta gaa ccg gat 414 
Val His Arg Trp Gly Ser His Thr Val Pro Phe Leu Leu Glu Pro Asd 
75 80 85 

aac ate aac ggc aaa acc tgc aca gca teg cac eta tgt cat aat act 462 
Asn He Asn Gly Lys Thr Cys Thr Ala Ser His Leu Cys His Asn Thr 
90 95 100 

cga tgc cac aat ccc ttg cac ttg tgc tgg gag tea eta gac gac aac 510 
Arg Cys His Asn Pro Leu His Leu Cys Trp Glu Set Leu Asp Asp Asn 
110 115 

aaa ggc aga aac tgg tgc ccc ggt ccc aac ggg gga tgt gtc cat gcg 558 
Lys Gly Arg Asn Trp Cys Pro Gly Pro Asn Gly Gly Cys \al His Ala 
120 125 130 135 

gtg gtt tgt tta agg cag ggt ccg ttg tac ggc cca ggg gcg act gtg 606 
Val Val Cys Leu Arg Gin Gly Pro Leu Tyr Gly Pro Gly Ala Thr Val 
140 145 150 

gca ggt cct caa caa agg ggc agt cac ttt gtg gta taactgeaga 652 
Ala Gly Pro Gin Gin Arg Gly Ser His Phe Val Val 
155 160 

agctteaatt geatgetcta gatgatcaaa gaattcctgg cctagtetat aggaggtttt 712 
gaaaagaaag gagcaataat cattttcttg ttctatcaag agggtgetat tgctccttte 772 
tttttttctc gagaggggta cgtaccgagc te 804 
<210> 5 
<211> 163 



<A 02470329 2004-06-14 



0817/00028 

6 

<212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence: insert of 
vector pCB289-13 

<400> 5 

Met Ala Leu Thr Asn Ala Gin lie Leu Ak Val He Asp Ser Trp Glu 
1 5 10 15 

Glu Thr Val Gly Gin Phe Pro Val He Thr His His Val Pro Leu Gly 
20 25 30 

Gly Gly Leu Gin Gly Thr Leu His Cys Tyr Glu lie Pro Leu Ala Ala 
35 40 45 

Pro Tyr Gly Val Gly Phe Ala Lys Asn Gly Pro Thr Arg Trp Gin Tyr 
50 55 60 

Lys Arg Thr De Asn Gin Val Val His Arg Trp Gly Ser His Thr Val 
65 70 75 80 

Pro Phe Leu Leu Glu Pro Asp Asn He Asn Gly Lys Thr Cys Thr Ala 
85 90 * 95 

Ser His Leu Cys His Asn Thr Arg Cys His Asn Pro Leu His Leu Cys 
100 * 105 110 

Trp Glu Ser Leu Asp Asp Asn Lys Gly Arg Asn Trp Cys Pro Gly Pro 
115 120 125 

Asn Gly Gly Cys Val His Ala Val Val Cys Leu Arg Gin Gly Pro Leu 
130 135 140 

Tyr Gly Pro Gly Ala Thr Val Ala Gly Pro Gin Gin Arg Gly Ser His 
145 150 155 160 

Phe Val Val 



<210> 6 
<211> 1150 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
Xhol / Bgl n fragment for construction of 
vector pCB304-25 

<400>6 

ctcgagtatt cggctcaatc cttttagtaa aagattgggc cgagtttaat tgcaattcaa 60 
ttaagagaac gaaggataat tacttgagtt ctttctcctt atccttcttt atttcctgct 120 
aatttatctg ctaatgtcta ctgaattcaa gcttggatcc gcggccgcct agcttgggtc 180 
ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg 240 
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cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct tcagcaatat 300 
cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg ccacagtcga 360 
tgaatccaga aaagcggcca ftttccacca tgatattcgg caagcaggca tcgccatggg 420 
tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac agttcggctg 480 
gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg gcttccatcc 540 
gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag gtagccggat 600 
caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctag gcaggagcaa 660 
ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag tcccttcccg 720 
cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc agccacgata 780 
gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc ttgacaaaaa 840 
gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag ccgattgtct 900 
gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa cctgcgtgca 960 
atccatcttg ttcaatccaa gctcccatgg ctgtttgtgg tgtcatgaat ccctccctac 1020 
aactagatcc tatatactat agagataggt ggataaatat ttttctttag taagacccca 1080 
tcgctaatat taatttatct aacatattaa ttaatattta atatataaat atatatagag 1140 
tcgagatcta 1150 

<210> 7 
<211> 454 
<212>DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: insert of 

vector pCB220- 17 
<400>7 

gaagacgatc ctaaatagca atatttacct ttgggaccaa aagttatcag gcatgcacct 60 
ggtagctagt ctttaaacca atagattgea tcggtttaaa aggcaagacc gtcaaattgc 120 
gggaaagggg tcaacagccg ttcagtacca agtctcaggg gaaactttga gatggccttg 180 
caaagggtat ggtaataagc tgaeggacat ggtcctaacc acgcagccaa gtcctaagtc 240 
aacagatctt ctgttgatat ggatgcagtt cacagactaa atgtcggtcg gggaagatgt 300 
attcttctca taagatatag tcggacctct ccttaatggg agetagegga tgaagtgatg 360 
caacactgga gccgctggga actaatttgt atgcgaaagt atattgatta gttttggagt 420 
actcgaaggtagcgaaattccttgtcaagtcttc 454 
<210>8 
<211> 498 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: insert of 
vector pCB270-l 

<400>8 

gcgcgcgtaa tacgactcac tatagggega attgggggcg actgtttacc aaaaacacag 60 
gtctccgcaa agtegtaaga ccatgtatgg gggctgaege ctgcccagtg ccggaaggtc 120 
aaggaagttg gtgacctgat gacaggggag ccggcgaccg aagccccggt gaaeggegge 180 
cgtaactata aeggtcctaa gtcttcgaat cgaattcact agtgatagga agaccttgtc 240 
gggtaagttc cgacccgcac gaaaggcgta acgatctggg cactgtctcg gagagaggct 300 
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cggtgaaata gacatgtctg tgaagatgcg gactacctgc acctggacag agagacccca 360 
attcctggcc tagtctatag gaggttttga aaagaaagga gcaataatca ttttcttgtt 420 
ctatcaagag ggtgctattg ctcctttctt tttttctcga ccagcttttg ttccctttag 480 
tgagggttaa ttgcgcgc 498 

<210> 9 
<211>789 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: lacZ gene 
with inserted intron to demonstrate splicing 
(from vector pCB315-l) 

<400> 9 

acaatttcca ttcgccattc aggctgegea actgttggga agggegateg gtgegggect 60 
ettegctatt acgccagctg gcgaaagggg gatgtgctgc aaggegatta agttgggtaa 120 
cgccagggtt ttcccagtca cgacgttgta aaacgaegge cagtgagege gcgtaatacg 180 
actcactata gggcgaattg ggtaceggge cccccctcga ggtcgacggt atcgataagc 240 
ttgacccgac aaggaatttc gctaccttcg agtactccaa aactaatcaa tatactttcg 300 
catacaaatt agttcccagc ggctccagtg ttgeatcact tcatccgcta gctcccatta 360 
aggagaggtc cgactatatc ttatgagaag aatacatctt ccccgaccga catttagtct 420 
gtgaactgea tccatatcaa cagaagatct gttgacttag gacttggctg cgtggttagg 480 
accatgtccg tcagcttatt accataccct ttgeaaggee atctcaaagt ttcccctgag 540 
acttggtact gaacggctgt tgaccccttt cccgcaattt gaeggtcttg ccttttaaac 600 
egatgeaate tattggttta aagactagct accaggtgea tgcctgataa cttttggtcc 660 
caaaggtaaa tattgetatt taggacegtt aggatccact agttctagag cggccgccac 720 
cgcggtggag ctccagcttt tgttcccttt agtgagggtt aattgegege ttggcgtaat 780 
catggtcat 789 

<210> 10 
<211> 2566 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: LLLtrB 
intron from vector pCB345-34. 

<400> 10 

gaattcgccc tttgaagact tgacatttga tatggtgaag tagggaggta ccgccttgtt 60 
cacattactg tgactggttt gcaccaccct ettegggaac cgtacgtacct:ctctcggag 120 
tataeggetc tgttattgtt cgttcgtaaa aattcacttg tgtttatgaa tcacgtgacg 180 
atgacaatga aagcatacaa caagagtttt acgttgtttc gctatcattg ccatttccca 240 
tttttctttg cctttaagat ttttgacctt attgacatgg tgaatttcat aggaagtatt 300 
ttcatcagat gttccacata attcacaaca tttagctttt aacctgttttcaagagtatt 360 
cegggcatag ccatacaata caggagcttg acttatctta teegtaaatt gataagggga 420 
tttacattca ctaaaatttg caaaataacg gegctgetta ecttgettta tetcataegg 480 
gatgccccac gaaccacttc catctttaaa catggaaatg gtttttgaaa gtgttccctt 540 
atgtttggag gctatcgttt ttagacagct gtattccata agataagcaa aataattgag 600 
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ctggttaaaa ttacttgcta gaccgtagta attacaaatc cctcttaatt cagaattata 660 
aattgtgatg atttctaagt ctgttgaacg aataagatat ttcctgtgaa ctggaaacca 720 
tgagctatct ttcttttgga tagctatttt cttgtcaaaa ataaattgac gaattttgtc 780 
ttgaagagga ataaggagtt ctacactccc attgagtgtt ctctttttga ctttaccaga 840 
tcgttttatc gttccacttc tccttactcg tatatcatat cccagaaaac gagcgggttg 900 
actgctatgt gtgatgagtg ttttttcttc actcaattcc atttttagct tgttatgaat 960 
aaaaagtttt aattgltctt ttatccattg acagtcctct ttgcttcctt taacagagat 1020 
aatgaagtcg tccgcatacc ggacgtattt caatacttta tttgtctgtg aggtacaggg 1080 
gagtgtgggt aatcttttac gtttttcttg atattctaaa agaactttag ctttttcttc 1140 
accctccaac ttcttgagac ggtgagaaat tctttttatc tcattgtgaa gttcccgata 1200 
ttcaggtgtt attctttctg gactttctcg gtcaaacttc attttgagtt gtaaaacaaa 1260 
cttatccaat tcatgaagat agatgttggc caaaagagga gatagaattc caccttgagg 1320 
tgttccgctg taagttttgt gatactgcca gttttccaga taacctgctt ttagaaattt 1380 
ataaatcaat tggctcattt tcatatcttt gattttaaga ttgatgagtc caatgagtgt 1440 
aacgtggtct atattatcga agcagccttt tatatctccc tccacaaacc atcttgcgcc 1500 
gccaaactct cttttgattg ttttcaaagc cgtgtgacag cttcgttgag gtctaaaacc 1560 
gtgagacaca tcttcgaata ccggttcata gatagattca agaattattc tcacagcttc 1620 
ttggatcaat ttatctgtga aagttggaat tcctaaaggt ctcatctttt tagaattctt 1680 
ttttgcaata tacattcttc gtacaggttg aggatagtaa gttccgtctt ttaaagattg 1740 
aataatcttt tttatttttt cttcactaaa gccatccgct gtatcatcta atattccttt 1800 
tgtggaagct cctttattgg aatataaatt ttgatacgcc acgtaataaa tatctggacg 1860 
taaaagataa cgataaagtc ttgtaaaaac ttcgtctata ttttcttgtg aatttttact 1920 
gattctttct aaaattgcca ttgttggttt cattttgagg ttttcctccc taatcaattt 1980 
ttaattttag tacacaataa ctgtacccct ttgccatgta aagggcgtta cccttctcag 2040 
actactacga gtactccgta cccttgcaag attttcaagc tctagtgcta tagccttttt 2100 
cctcctttct attaggcatt cttgtttagg gtatccccag ttagtgttaa gtcttggtaa 2160 
attcagattc tcggcatcgc tttcgtttcg ttcccatagg ttctcctaca gattgtacaa 2220 
atgtggtgat aacagataag tccacaacca taacttacct ttctttgtac tagaggtttc 2280 
agacactttc ctctatcgac acataaccga aattagaaac ttgckttcag taaacacaac 2340 
ttatacctta tatctgatta acattgcgac tcagtcgtac ccgattgtct ttaggtaact 2400 
catcgctttc caaccgtgct ctgttcccgt atcagctttc gcttttcggt taggttggct 2460 
gttttctgtg ttatcttaca gagtagtacc ttaaactact tgacttaaca ccctatctgg 2520 
gcgcacgtta tggatgtgtt aggaatgtct tctcaagggc gaattc 2566 

<210> 11 

<211>520 

<212>DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: synthetic 
sequence coding for I-Ppo I homing-endonuclease 

<220> 
<221> CDS 
<222> (16)..(504) 

<400> 11 

gcatatgaga tctcc atg gcg etc acc aat get caa ate ttg get gtg att 51 
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Met Ala Leu Thr Asn Ala Gin lie Leu Ala Val lie 
1 5 10 

gac agt tgg gaa gaa aca gtc ggt cag ttt cca gtg ata acg cac cat 99 
Asp Ser Trp GIu Glu Thr Val Gly Gin Phe Pro Val He Thr His His 
15 20 25 

gta cca tta ggt ggc ggt ctg caa gga acg etc cat tgt tac gag ate 147 
Val Pro Leu Gly Gly Gly Leu Gin Gly Thr Leu His Cys Tyr Glu He 
30 35 40 

ccc eta gca get cct tat ggg gtt ggc ttt get aag aat ggg cct acc 195 
Pro Leu Ala Ala Pro Tyr Gly Val Gly Phe Ala Lys Asn Gly Pro Thr 
45 50 55 60 

cgc tgg caa tac aaa egg aca ate aat caa gtc gtc cac aga tgg ggg 243 
Arg Trp Gin Tyr Lys Arg Thr lie Asn Gin Val Val His Aig Trp Gly 
65 70 75 

tec cac aca gtc cct ttt eta tta gaa ccg gat aac ate aac ggc aaa 291 
Ser His Thr Val Pro Phe Leu Leu Glu Pro Asp Asn He Asn Gly Lys 
80 85 90 

acc tgc aca gca teg cac eta tgt cat aat act cga tgc cac aat ccc 339 
Thr Cys Thr Ala Ser His Leu Cys His Asn Thr Arg Cys His Asn Pro 
95 100 105 

ttg cac ttg tgc tgg gag tea eta gac gac aac aaa ggc aga aac tgg 387 
Leu His Leu Cys Trp Glu Ser Leu Asp Asp Asn Lys Gly Arg Asn Trp 
110 115 120 

tgc ccc ggt ccc aac ggg gga tgt gtc cat gcg gtg gtt tgt tta agg 435 
Cys Pro Gly Pro Asn Gly Gly Cys Val His Ala Val Val Cys Leu Arg 
125 130 135 140 

cag ggt ccg ttg tac ggc cca ggg gcg act gtg gca ggt cct caa caa 483 
Gin Gly Pro Leu Tyr Gly Pro Gly Ala Thr Val Ala Gly Pro Gin Gin 
145 150 155 

agg ggc agt cac ttt gtg gta taactgeage tcgagg 520 
Arg Gly Ser His Phe Val Val 
160 

<210> 12 
<211> 163 
<212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence: synthetic 
sequence coding for I-Ppo I homing-endonuclease 

<400> 12 

Met Ala Leu Thr Asn Ala Gin He Leu Ala Val He Asp Ser Trp Glu 
15 10 15 

Glu Thr Val Gly Gin Phe Pro Val He Thr His His Val Pro Leu Gly 
20 25 30 
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Gly Gly Leu Gin Gly Thr Leu His Cys Tyr Glu lie Pro Leu Ala Ala 
35 40 45 

Pro Tyr Gly Val Gly Phe Ala Lys Asn Gly Pro Thr Arg Trp Gin Tyr 
50 55 60 

Lys Arg Thr Ee Asn Gin Val Val His Arg Trp Gly Ser His Thr Val 
65 70 75 80 

Pro Phe Leu Leu Glu Pro Asp Asn lie Asn Gly Lys Thr Cys Thr Ala 
85 90 95 

Ser His Leu Cys His Asn Thr Arg Cys His Asn Pro Leu His Leu Cys 
100 105 110 

Trp Glu Ser Leu Asp Asp Asn Lys Gly Arg Asn Trp Cys Pro Gly Pro 
115 120 125 

Asn Gly Gly Cys Val His Ala Val Val Cys Leu Arg Gin Gly Pro Leu 
130 135 140 

Tyr Gly Pro Gly Ala Thr Val Ala Gly Pro Gin Gin Arg Gly Ser His 
145 150 155 160 

Phe Val Val 



<210> 13 
<211> 481 
<212> DNA 

<213> Chlamydomonas pallidostigmatica 
<220> 
<221> CDS 
<222> (4)..(459) 

<223> ORF coding for I-Cpa I homing endonuclease 
<400>13 

acc atg gac att aat cct caa tgg att aca ggt ttc gta gat ggg gaa 48 
Met Asp De Asn Pro Gin Tip lie Thr Gly Phe Val Asp Gly Glu 
15 10 15 

ggt tgt ttt agt gta agt ata ctt aga aat aat teg ttg cgc tat ggc 96 
Gly Cys Phe Ser Val Ser He Leu Arg Asn Asn Ser Leu Arg Tyr Gly 

20 25 30 

cat cag ctt caa cca gaa ttc gta gtg acc caa cat aaa tta gat gca 144 
His Gin Leu Gin Pro Glu Phe Val Val Thr Gin His Lys Leu Asp Ala 
35 40 45 

aat gtt tta tat gca tta aaa gac tac ttt aaa gtt gga tea gtc gtt 192 
Asn Val Leu Tyr Ala Leu Lys Asp Tyr Phe Lys Val Gly Ser Val Val 
50 55 60 

gtg aat cat ggg gaa egg ctt tgc tat aaa gtc aaa aat att gat cac 240 
Val Asn His Gly Glu Arg Leu Cys Tyr Lys Val Lys Asn He Asp His 
65 70 75 
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ttt ata acc gtc att ata cca ttt ttc gaa aaa cat gag eta aaa aca 288 
Phe He Thr Val He He Pro Phe Phe Glu Lys His Glu Leu Lys Thr 
80 85 90 95 

aaa aga aga att gaa ttt ctt cga ttt cga aaa ate tgc ttg ctg tta 336 
Lys Arg Arg He Glu Phe Leu Arg Phe Arg Lys He Cys Leu Leu Leu 
100 105 110 

aaa gca ggt aga cat tta gaa teg cag gaa gga ttc gag aaa gtg ttg 384 
Lys Ala Gly Arg His Leu Glu Ser Glu Glu Gly Phe Glu Lys Val Leu 
115 120 125 

gat tta gca aaa aaa etc cgt ate aat gag aaa aac tac cag gaa tct 432 
Asp Leu Ala Lys Lys Leu Arg He Asn Glu Lys Asn Tyr Gin Glu Ser 
130 : 135 140 

ate aaa cgt ttt gaa gaa act ggc gag taactcgaga gtatagagct cc 481 
He Lys Arg Phe Glu Glu Thr Gly Glu 
145 150 

<210> 14 
<211> 152 
<212> PRT 

<213> Chlamydoxnonas pallidosu'gmatica 
<400> 14 

Met Asp He Asn Pro Gin Trp He Thr Gly Phe Val Asp Gly Glu Gly 
1 5 10 15 

Cys Phe Ser Val Ser He Leu Arg Asn Asn Ser Leu Arg Tyr Gly His 
20 25 30 

Gin Leu Gin Pro Glu Phe Val Vhl Thr Gin His Lys Leu Asp Ala Asn 
35 40 45 

Val Leu Tyr Ala Leu Lys Asp Tyr Phe Lys Val Gly Ser Val Val Val 
50 55 60 

Asn His Gly Glu Arg Leu Cys Tyr Lys Val Lys Asn He Asp His Phe 
65 70 75 80 

He Thr Val ne lie Pro Phe Phe Glu Lys His Glu Leu Lys Thr Lys 
85 90 95 

Arg Arg He Glu Phe Leu Arg Phe Arg Lys He Cys Leu Leu Leu Lys 
100 105 110 

Ala Gly Arg His Leu Glu Ser Gin Glu Gly Phe Glu Lys Val Leu Asp 
115 120 125 

Leu Ala Lys Lys Leu Arg He Asn Glu Lys Asn Tyr Gin Glu Ser He 
130 135 140 

Lys Arg Phe Glu Glu Thr Gly Glu 
145 150 



CA 02470329 2004-06-14 



0817/00028 

13 



<210> 15 
<211> 917 
<212> DNA 

<213> Chlamydomonas pallidostigmatica 
<220> 

<221> intron 
<222> (18)..(893) 
<223> CPLSU2 Intron 

<400> 15 

gaagacgatc ctaaggttaa ttgctagctt tagtaaaact gaactatatg ctggaaaatc 60 
ctcgggcctc ttctttcccc agttgcactt cgtgctaaag cgggcagatg aaactataaa 120 
gcaaaaagag gctcatcact tttttttacc ggaactccgt tccggcttaa agtgatgaaa 180 
aagatgtctt gtacgtctcg tattttgcca cgagccgtga taatcaagcg tacatgggga 240 
caatcagcag ggaaggatgt ttgatacacg cagagtagaa ctggcagtct agaatcggtc 300 
ggtatcgcta aacataaccc tcagagacta tacgttcgga agtcttaata atcataataa 360 
ta ggaggaaa ttgaaaatgg acattaatcc tcaatggatt acaggtttcg tagatgggga 420 
aggttgtttt agtgtaagta tacttagaaa taattcgttg cgctatggcc atcagcttca 480 
accagaattc gtagtgaccc aacataaatt agatgcaaat gttttatatg cattaaaaga 540 
ctactttaaa gttggatcag tcgttgtgaa tcatggggaa cggctttgct ataaagtcaa 600 
aaatattgat cactttataa ccgtcattat accatttttc gaaaaacatg agctaaaaac 660 
aaaaagaaga attgaatttc ttcgatttcg aaaaatctgc ttgctgttaa aagcaggtag 720 
acatttagaa tcgcaggaag gattcgagaa agtgttggat ttagcaaaaa aactccgtat 780 
caatgagaaa aactaccagg aatctatcaa acgttttgaa gaaactggcg agtaaaaaaa 840 
ataagattta agatagagtc cagcctattt tgaagaaaaa tggggtaaat ctgagcgaaa 900 
ttccttgtca agtcttc 917 

<210> 16 
<2U> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 16 

taaggccctc ggtagcaacg g 21 

<210> 17 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 



<400> 17 

ggggtaccaa atccaactag 



20 
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<210> 18 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 18 

ggagctcgct cccccgccgt cgttc 25 

<210> 19 
<2U> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400>19 

gatgcatgat gacttgacgg catcctc 27 

<210> 20 
<211> 16 
<212>DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400>20 

gtcgacagat ctttaa 16 

<210> 21 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400>21 

agatctgtcg acttaa 16 

<210> 22 
<211> 22 
<212>DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 
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<400> 22 

gatctccagt taactggggt ac 22 

<210>23 
<211> 14 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400>23 

cccagttaac tgga 14 

<210> 24 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400>24 

ttaagccagt taactgggcg gagct 25 

<210> 25 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400>25 

ccgcccagtt aactggc 17 

<210> 26 
<211> 34 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 26 

tcgagaagat cagcctgtta tccctagagt aact 34 

<210>27 
<211> 34 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400>27 

ctagagttac tctagggata acaggctgat cttc 34 

<210> 28 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 28 

agaagacgat cctaagg 17 

<210> 29 
<2U> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400>29 

tgaagacttg acaaggaatt teg 23 

<210> 30 
<211> 56 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 30 

agaagacgat cctaaatagc aatatttacc tttgggacca aaagttatca ggcatg 56 

<210> 31 
<211> 50 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 31 

tgaagacttg acaaggaatt tcgctacctt cgagtactcc aaaactaatc 50 

<210> 32 
<211> 33 
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<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 32 

gagaagacat tcctaacaca tccataacgt gcg 33 

<210> 33 
<211> 32 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400>33 

tgaagacttg acatttgata tggtgaagta gg 32 

<210> 34 
<211> 175 
<212> DNA 
<213> Pisum sativum 

<220> 
<221> CDS 
<222> (1)..(174) 

<223> plastid translocaUzation sequence 
<400> 34 

atg get tct atg ata tec tct tea get gtg act aca gtc age cgt get 48 
Met Ala Ser Met He Ser Ser Ser Ala Val Thr Thr Val Ser Arg Ala 
15 10 15 

tct acg gtg caa teg gec gcg gtg get cca ttc ggc ggc etc aaa tec 96 
Ser Thr Val Gin Ser Ala Ala Val Ala Pro Phe Gly Gly Leu Lys Ser 
20 25 30 

atg act gga ttc cca gtt aag aag gtc aac act gac att act tec art 144 
Met Thr Gly Phe Pro Val Lys Lys Val Asn Thr Asp He Thr Ser He 
35 40 45 

aca age aat ggt gga aga gta aag tgc atg c 175 
Thr Ser Asn Gly Gly Arg Val Lys Cys Met 
50 55 

<210> 35 

<211> 58 

<212> PRT 

<213> Pisum sativum 
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<400>35 

Met Ala Ser Met lie Ser Ser Ser Ala Val Thr Thr Val Ser Arg Ala 
15 10 IS 

Ser Thr Val Gin Ser Ala Ala Val Ala Pro Phe Gly Gly Leu Lys Ser 
20 25 30 

Met Thr Gly Phe Pro Val Lys Lys Val Asn Thr Asp He Thr Ser lie 
35 40 45 

Thr Ser Asn Gly Gly Arg Val Lys Cys Met 
50 55 



<210> 36 
<211> 84 
<212> PRT 

<213> Nicotiana tabacum 
<220> 

<221> TRANSIT 
<222> (1).<84) 

<223> plastidic translocalisation signal derived from 
plastidic transketolase 

<400> 36 

Met Ala Ser Ser Ser Ser Leu Thr Leu Ser Gin Ala lie Leu Ser Arg 
1 5 10 15 

Ser Val Pro Arg His Gly Ser Ala Ser Ser Ser Gin Leu Ser Pro Ser 
20 ~ 25 30 

Ser Leu Thr Phe Ser Gly Leu Lys Ser Asn Pro Asn lie Thr Thr Ser 
35 40 ' 45 

Arg Arg Arg Thr Pro Ser Ser Ala Ala Ala Ala Ala Val Val Arg Ser 
50 55 60 

Pro Ala lie Arg Ala Ser Ala Ala Thr Glu Thr lie Glu Lys Thr Glu 
65 70 75 80 

Thr Ala Gly Ser 

<210> 37 
<211> 258 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> transit_pepdde 
<222> (1)..(258) 

<223> plastidic translocalisation signal derived from 
plastidic transketolase (frame 1) 
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<400> 37 

ggtaccatgg cgtcttcttc ttctctcact ctctctcaag ctatcctctc tcgttctgtc 60 
cctcgccatg gctctgcctc ttcttctcaa ctttcccctt cttctctcac tttttccggc 120 
cttaaatcca atcccaatat caccacctcc cgccgccgta ctccttcctc cgccgccgcc 180 
gccgccgtcg taaggtcaoc ggcgattcgt gcctcagctg caaccgaaac catagagaaa 240 
actgagactg cgggatcc 258 

<210> 38 
<211> 260 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> transit jpeptide 
<222> (1)..(260) 

<223> plastidic translocalisation signal derived from 

plastidic transketolase (frame 2) 
<400> 38 

ggtaccatgg cgtcttcttc ttctctcact ctctctcaag ctatcctctc tcgttctgtc 60 
cctcgccatg gctctgcctc ttcttctcaa ctttcccctt cttctctcac tttttccggc 120 
cttaaatcca atcccaatat caccacctcc cgccgccgta ctccttcctc cgccgccgcc 180 
gccgccgtcg taaggtcacc ggcgattcgt gcctcagctg caaccgaaac catagagaaa 240 
actgagactg cgctggatcc 260 

<210> 39 
<211> 259 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> transit_peptide 
<222> (1)..(259) 

<223> plastidic translocalisation signal derived from 
plastidic transketolase (frame 3) 

<400>39 

ggtaccatgg cgtcttcttc ttctctcact ctctctcaag ctatcctctc tcgttctgtc 60 
cctcgccatg gctctgcctc ttcttctcaa ctttcccctt cttctctcac tttttccggc 120 
cttaaatcca atcccaatat caccacctcc cgccgccgta ctccttcctc cgccgccgcc 180 
gccgccgtcg taaggtcacc ggcgattcgt gcctcagctg caaccgaaac catagagaaa 240 
actgagactg cggggatcc 259 

<210>40 
<211> 63 
<212> PRT 

<213> Arabidopsis thaliana 
<220> 

<221> TRANSIT 
<222> (1)..(63) 

<223> plastidic translocalisation signal derived from 
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plastidic isopentenylpyrophosphate isomerase-2 
(IPP-2) 

<400>40 

Met Ser Ala Ser Ser Leu Phe Asn Leu Pro Leu lie Arg Leu Arg Ser 
1 5 10 15 

Leu Ala Leu Ser Ser Ser Phe Ser Ser Phe Arg Phe Ala His Arg Pro 
20 25 30 

Leu Ser Ser lie Ser Pro Arg Lys Leu Pro Asn Phe Arg Ala Phe Ser 
35 40 45 

Gly Thr Ala Met Thr Asp Thr Lys Asp Gly Ser Arg VaJ Asp Met 
50 55 60 

<210> 41 
<211> 205 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> transit_peptide 
<222> (1)..(205) 

<223> plastidic translocalisation signal derived from 
plastidic isopentenylpyrophosphate isomerase-2 
(IPP-2) (frame 1) 

<400> 41 

gatatccaca ccaacaccaa tgtctgcttc ttctttattt aatctcccat tgattcgcct 60 
cagatctctc gctctttcgt cttctttttc ttctttccga tttgcccatc gtcctctgtc 120 
atcgatttca ccgagaaagt taccgaattt tcgtgctttc tctggtaccg ctatgacaga 180 
tactaaagat ggatcccggg tcgac 205 

<210> 42 
<211> 207 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> transit_peptide 
<222> (1)..(207) 

<223> plastidic translocalisation signal derived from 
plastidic isopentenylpyrophosphate isomerase-2 
(IPP-2) (frame 2) 

<400> 42 

gatatccaca ccaacaccaa tgtctgcttc ttctttattt aatctcccat tgattcgcct 60 
cagatctctc gctctttcgt cttctttttc ttctttccga tttgcccatc gtcctctgtc 120 
atcgatttca ccgagaaagt taccgaattt tcgtgctttc tctggtaccg ctatgacaga 180 
tactaaagat ctggatcccg ggtcgac 207 

<210> 43 
<211> 206 
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<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> transit_peptide 
<222> (1)..(206) 

<223> plastidic translocalisation signal derived from 
plastidic isopentenylpyrophosphate isomerase-2 
(IPP-2) (frame 3) 

<400>43 

gatatccaca ccaacaccaa tgtctgcttc ttctttattt aatctcccat tgattcgcct 60 
cagatctctc gctctttcgt cttctttttc ttctttccga tttgcccatc gtcctctgtc 120 
atcgatttca ccgagaaagt taccgaattt tcgtgctttc tctggtaccg ctatgacaga 180 
tactaaagat gggatcccgg gtcgac " 206 

<210> 44 
<211> 44 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> promoter 
<222> (1)..(44) 

<223> plastidic promoter PrbcL 
<400>44 

gttgcgctat atatatgaaa gagtatacaa taatgatgta tttg 44 

<210> 45 
<211> 64 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> promoter 
<222> (1)..(64) 

<223> plastidic promoter Prpsl6-107 
<400>45 

tagcgatggg gtcttactaa agaaaaatat ttatccacct atctctatag tatatagata 60 
taga 64 

<210> 46 
<211> 91 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> promoter 
<222> (1)..(91) 

<223> plastidic promoter Prml6 
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<400>46 

cgccgtcgtt caatgagaat ggataagagg ctcgtgggat tgacgtgagg gggcagggat 60 
ggctatattt ctgggagcga actccgggcg a 91 

<210> 47 
<211> 71 
<212> DNA 

<213> Nicotiana tabaoim 
<220> 

<221> promoter 
<222> (1)..(71) 

<223> plastidic promoter PaccD-129 
<400> 47 

gtcgacatat tattttaaat aatataaagg gggttccaac atattaatat atagtgaagt 60 
gttccggatcc 71 

<210> 48 
<211> 27 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> promoter 
<222> (1)..(27) 

<223> plastidic promoter PclpP-53 
<400>48 

agacaataaa aaaaattgtt acgtttc 27 

<210> 49 
<211> 65 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> promoter 
<222> (1)..(65) 

<223> plastidic promoter Prrn-62 
<400> 49 

gagcgaactc cgggcgaata tgaagcgcat ggatacaagt tatgccttgg aatgaaagac 60 
aattc 65 

<210> 50 
<211> 107 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> promoter 
<222> (1)..(107) 
<223> plastidic promoter Prpsl6 
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<400>50 

tctatatata tttatatatt aaatattaat taatatgtta gataaattaa tattagcgat 60 
ggggtcttactaaagaaaaatatttatccacctatctctatagtata 107 
<210> 51 
<211> 47 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> promoter 
<222> (1)..(47) 

<223> plastidic promoter PatpB/E-290 
<400> 51 

agaaatagaa aataaagttc aggttcgaat tccatagaat agataat 47 

<210> 52 
<211> 25 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> promoter 
<222> (1)..(25) 

<223> plastidic promoter PrpoB-345 
<400>52 

aatgtgtatt atcataataa tggta 25 

<210> 53 
<211> 35 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: consensus 
sequence of Rcoli sigma70 promoter 

<400> 53 

ttgacattca ctcttcaatt atctataatg ataca 35 

<210> 54 
<211> 80 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> misc feature 
<222> (1)..(80) 

<223> 5'-untranslated region of psbA gene (incl. Ncol 
site) 
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<400> 54 

tccattttct attttgattt gtagaaaact agtgtgcttg ggagtccctg atgattaaat 60 
aaaccaagat tttaccatgg 80 

<210>55 
<211> 40 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> misc_feature 
<222>(1)..(40) 

<223> 5'-untranslated region of rbcL gene 
<400>55 

agttgtaggg agggattcat gacaccacaa acagccatgg 40 

<210>56 
<211> 22 
<212> DNA 

<213> Nicotiana tabacum 
<220> 

<221> misc feature 
<222> (1)..(22) 

<223> 5 '-untranslated region of rbcLs gene (modified) 
<400> 56 

agttgtaggg agggattcat ga 22 

<210> 57 

<211> 127 

<212>DNA 

<213> Synechocystis sp. 

<220> 

<221>misc feature 
<222> (1)..(127) 

<223> 3 '-untranslated region of synechcystis psbA-1 gene 
<400> 57 

tgccattgcc ataactgctt tcggttagac ttcgtttcat ttggttaatc aagggcactc 60 
tcgcaatggg gtgcctttta tggtccaagg ttaaagttaa gccagtacta tttctagggt 120 
gaaatgt 127 

<210> 58 
<211> 90 
<212>DNA 

<213> Nicotiana tabacum 
<220> 

<221> misc feature 
<222> (1)..(90) 

<223> 3'-untranslated region of tobacco psbA gene 
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<400> 58 

cctggcctag tctataggag gttttgaaaa gaaaggagca ataatcattt tcttgttcta 60 
tcaagagggt gctattgctc ctttcttttt 90 

<210> 59 
<211> 136 
<212>DNA 

<213> Nicotiana tabacum 
<220> 

<221> misc_feature 
<222> (1)..(136) 

<223> 3 '-untranslated region of tobacco rbcL gene 
<400>59 

agtagacatt agcagataaa ttagcaggaa ataaagaagg ataaggagaa agaactcaag 60 
taattatcct tcgttctctt aattgaattg caattaaact cggcccaatc ttttactaaa 120 
aggattgagc cgaata 136 

<210> 60 
<211> 13 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: synthetic 
ribosome binding site (RBS) 

<400>60 

ggaggnnnnn atg 13 

<210> 61 
<211> 13 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: synthetic 
ribosome binding site (RBS) 

<400> 61 

ggaggatctc atg 13 

<210> 62 
<211> 1494 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: complete 
insert of vector pCB304-25 

<400>62 

ggtaccgggc cgataattcc tggcctagtc tataggaggt tttgaaaaga aaggagcaat 60 
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aatcattttc ttgttctatc aagagggtgc tattgctoct ttcttttttt ctcgagtacg 120 
ctagggataa cagggtaata tagaagcttg actctcttaa agatctcgac tctatatata 180 
tttatatatt aaatattaat taatatgtta gataaattaa tattagcgat ggggtcttac 240 
taaagaaaaa tatttatcca cctatctcta tagtatatag gatctagttg tagggaggga 300 
ttcatgacac cacaaacagc catgggagct tggattgaac aagatggatt gcacgcaggt 360 
tctccggccg cttgggtgga gaggctattc ggctatgact gggcacaaca gacaatcggc 420 
tgctctgatg ccgccgtgtt ccggctgtca gcgcaggggc gcccggttct ttttgtcaag 480 
accgacctgt ccggtgccct gaatgaactg caggacgagg cagcgcggct atcgtggctg 540 
gccacgacgg gcgttccttg cgcagctgtg ctcgacgttg tcactgaagc gggaagggac 600 
tggctgctat tgggcgaagt gccggggcag gatctcctgt catctcacct tgctcctgcc 660 
gagaaagtat ccatcatggc tgatgcaatg cggcggctgc atacgcttga tccggctacc 720 
tgcccattcg accaccaagc gaaacatcgc atcgagcgag cacgtactcg gatggaagcc 780 
ggtcttgtcg atcaggatga tctggacgaa gagcatcagg ggctcgcgcc agccgaactg 840 
ttcgccaggc tcaaggcgcg catgcccgac ggcgaggatc tcgtcgtgac ccatggcgat 900 
gcctgcttgc cgaatatcat ggtggaaaat ggccgctttt ctggattcat cgactgtggc 960 
cggctgggtg tggcggaccg ctatcaggac atagcgttgg ctacccgtga tattgctgaa 1020 
gagcttggcg gcgaatgggc tgaccgcttc ctcgtgcttt acggtatcgc cgctcccgat 1080 
tcgcagcgca tcgccttcta tcgccttctt gacgagttct tctgagcggg acccaagcta 1140 
ggcggccgcg gatccaagct tgaattcagt agacattagc agataaatta gcaggaaata 1200 
aagaaggata aggagaaaga actcaagtaa ttatccttcg ttctcttaat tgaattgcaa 1260 
ttaaactcgg cccaatcitt tactaaaagg attgagccga atactcgact taaggtagcg 1320 
tcgagtacgc gcccggggag cccaagggca cgccctggca cccgaagctc tagtatcaaa 1380 
tttggcacaa aaagcaaaat taaaatactg ataattgcca acacaattaa catctcaatc 1440 
aaggtaaatg ctttttgctt tttttgccaa agctatcttc cgtgatcaga gctc 1494 

<210> 63 
<211> 592 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: BgUI / Muni 
fragment from vector pCB320- 192 

<400> 63 

agatcttcca ttttctattt tgatttgtag aaaactagtg tgcttgggag tccctgatga 60 
ttaaataaac caagatttta ccatggcgct caccaatgct caaatcttgg ctgtgattga 120 
cagttgggaa gaaacagtcg gtcagtttcc agtgataacg caccatgtac cattaggtgg 180 
cggtctgcaa ggaacgctcc attgttacga gatcccccta gcagctcctt atggggttgg 240 
ctttgctaag aatgggccta cccgctggca atacaaacgg acaatcaatc aagtcgtcca 300 
cagatggggg tcccacacag tcccttttct attagaaccg gataacatca acggcaaaac 360 
ctgcacagca tcgcacctat gtcataatac tcgatgccac aatcccttgc acttgtgctg 420 
ggagtcacta gacgacaaca aaggcagaaa ctggtgcccc ggtcccaacg ggggatgtgt 480 
ccatgcggtg gtttgtttaa ggcagggtcc gttgtacggc ccaggggcga ctgtggcagg 540 
tcctcaacaa aggggcagtc actttgtggt ataactgcag aagcttcaat tg 592 

<210> 64 
<211> 25 
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<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 64 

aaagatctcc tcacaaaggg ggtcg 25 

<210> 65 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 65 

tcgaagactt aggaccgtta tag 23 
<210> 66 
<211> 26 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 

oligonucleotide primer 
<400> 66 

aggaagacct tgtcgggtaa gttccg 26 

<210> 67 
<211> 31 
<212>DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400>67 

ctcaattggg gtctctctgt ccaggtgcag g 31 

<210>68 
<211> 672 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: artificial 
fusion protein of I-Ppo I homing endonuclease and 
plastidic transit peptide 
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<220> 
<221> CDS 
<222> (1)..(669) 

<400>68 

atg tct get tct tct tta ttt aat etc cca ttg att cgc etc aga tct 48 

Met Ser Ala Ser Ser Leu Phe Asn Leu Pro Leu lie Arg Leu Arg Ser 

1 5 10 15 

etc get ctt teg tct tct ttt tct tct ttc cga ttt gec cat cgt cct 96 
Leu Ala Leu Ser Ser Ser Phe Ser Ser Phe Arg Phe Ala His Arg Pro 
20 25 30 

ctg tea teg att tea ccg aga aag tta ccg aat ttt cgt get ttc tct 144 
Leu Ser Ser lie Ser Pro Arg Lys Leu Pro Asn Phe Arg Ala Phe Ser 
35 40 45 

ggt ace get atg aca gat act aaa gat ggg ate ccc atg gcg etc ace 192 
Gly Thr Ala Met Thr Asp Thr Lys Asp Gly De Pro Met Ala Leu Thr 
50 55 60 

aat get caa ate ttg get gtg att gac agt tgg gaa gaa aca gtc ggt 240 
Asn Ala Gin De Leu Ala Val lie Asp Ser Trp Glu Glu Thr Val Gly 
65 70 75 * 80 

cag ttt cca gtg ata acg cac cat gta cca tta ggt ggc ggt ctg caa 288 
Gin Phe Pro Val lie Thr His His Val Pro Leu Gly Gly Gly Leu Gin 
85 90 95 

gga acg etc cat tgt tac gag ate ccc eta gca get cct tat ggg gtt 336 
Gly Thr Leu His Cys Tyr Glu lie Pro Leu Ala Ala Pro Tyr Gly Val 
100 105 110 

ggc ttt get aag aat ggg cct ace cgc tgg caa tac aaa egg aca ate 384 
Gly Phe Ala Lys Asn Gly Pro Thr Arg Trp Gin Tyr Lys Arg Thr lie 
115 120 125 

aat caa gtc gtc cac aga tgg ggg tec cac aca gtc cct ttt eta tta 432 
Asn Gin Val Val His Arg Trp Gly Ser His Thr Val Pro Phe Leu Leu 
130 135 140 

gaa ccg gat aac ate aac ggc aaa ace tgc aca gca teg cac eta tgt 480 
Glu Pro Asp Asn lie Asn Gly Lys Thr Cys Thr Ala Ser His Leu Cys 
145 150 155 160 

cat aat act cga tgc cac aat ccc ttg cac ttg tgc tgg gag tea eta 528 
His Asn Thr Arg Cys His Asn Pro Leu His Leu Cys Trp Glu Ser Leu 
165 170 175 

gac gac aac aaa ggc aga aac tgg tgc ccc ggt ccc aac ggg gga tgt 576 
Asp Asp Asn Lys Gly Arg Asn Trp Cys Pro Gly Pro Asn Gly Gly Cys 
180 185 190 

gtc cat gcg gtg gtt tgt tta agg cag ggt ccg ttg tac ggc cca ggg 624 
Val His Ala Val Val Cys Leu Arg Gin Gly Pro Leu Tyr Gly Pro Gly 
195 200 205 
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K 8 S? ?, 8 i g ^ 6 £ 038 CM agg ggC agt cac ttt 8* toa 67 2 
Ala Thr Val Ala Gly Pro Gin Gin Arg Gly Ser His Phe Val Val 

210 215 220 

<210> 69 
<211> 223 
<212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence: artificial 
fusion protein of I-Ppo I homing endonuclease and 
plastidic transit peptide 

<400>69 

Met Ser Ala Ser Ser Leu Phe Asn Leu Pro Leu lie Are Leu Are Ser 
1 5 10 15 6 6 

Leu Ala Leu Ser Ser Ser Phe Ser Ser Phe Arg Phe Ala His Arg Pro 
20 25 30 

Leu Ser Ser He Ser Pro Arg Lys Leu Pro Asn Phe Arg Ala Phe Ser 
35 40 45 

Gly Thr Ala Met Thr Asp Thr Lys Asp Gly He Pro Met Ala Leu Thr 
50 55 60 

Asn Ala Gin lie Leu Ala Val He Asp Ser Trp Glu Glu Thr Val Glv 
65 70 75 80 

Gin Phe Pro Val He Thr His His Val Pro Leu Gly Gly Gly Leu Gin 
85 90 95 

Gly Thr Leu His Cys Tyr Glu He Pro Leu Ala Ala Pro Tyr Glv Val 
100 105 HO 

Gly Phe Ala Lys Asn Gly Pro Thr Arg Trp Gin Tyr Lys Arg Thr He 

Asn Gin Val Val His Arg Trp Gly Ser His Thr Val Pro Phe Leu Leu 
130 135 140 

Glu Pro Asp Asn He Asn Gly Lys Thr Cys Thr Ala Ser His Leu Cvs 
I 45 150 155 160 

His Asn Thr Arg Cys His Asn Pro Leu His Leu Cys Trp Glu Ser Leu 
165 170 175 

Asp Asp Asn Lys Gly Arg Asn Trp Cys Pro Gly Pro Asn Gly Gly Cys 
180 185 190 

Val His Ala Val Val Cys Leu Arg Gin Gly Pro Leu Tyr Gly Pro Glv 
195 200 205 » 

Ala Thr Val Ala Gly Pro Gin Gin Arg Gly Ser His Phe Val Val 
210 215 220 
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<210> 70 
<211> 558 
<212> DNA 

<213> Pbysarum polycephalum 

<220> 
<221> CDS 
<222> (1)..(555) 

<223> coding for long version of I-Ppol 
<400>70 

atg gcg aaa tec aac caa get egg gta aac ggc ggg agt aac tat gac 48 
Met Ala Lys Ser Asn Gin Ala Arg "Val Asn Gly Gly Ser Asn Tyr Asp 
15 10 " 15 

tct etc acc ccc tta aat atg gcg etc acc aat get caa ate ttg get 96 
Ser Leu Thr Pro Leu Asn Met Ala Leu Thr Asn Ala Gin lie Leu Ala 
20 25 30 

gtg att gac age tgg gaa gaa aca gtc ggt cag ttt cca gtg ata acg 144 
Val lie Asp Ser Trp Glu Glu Thr Val Gly Gin Phe Pro Val lie Thr 
35 40 45 

cac cat gta cca tta ggt ggc ggt ctg caa gga acg etc cat tgt tac 192 
His His Val Pro Leu Gly Gly Gly Leu Gin Gly Thr Leu His Cys Tyr 
50 55 60 

gag ate ccc eta gca get cct tat ggg gtt ggc ttt get aag aat ggg 240 
Glu He Pro Leu Ala Ala Pro Tyr Gly Val Gly Phe Ala Lys Asn Gly 
65 70 75 80 

cct acc cgc tgg caa tac aaa egg aca ate aat caa gtc gtc cac aga 288 
Pro Thr Arg Trp Gin Tyr Lys Arg Thr lie Asn Gin Val Val His Arg 
85 90 95 

tgg gga tec cac aca gtc cct ttt eta tta gaa ccg gat aac ate aac 336 
Trp Gly Ser His Thr Val Pro Phe Leu Leu Glu Pro Asp Asn He Asn 
100 105 110 

ggc aaa acc tgc aca gca teg cac eta tgt cat aat act cga tgc cac 384 
Gly Lys Thr Cys Thr Ala Ser His Leu Cys His Asn Thr Arg Cys His 
115 120 125 

aat ccc ttg cac ttg tgc tgg gag tea eta gac gac aac aaa ggc aga 432 
Asn Pro Leu His Leu Cys Trp Glu Ser Leu Asp Asp Asn Lys Gly Arg 
130 135 140 

aac tgg tgc ccg ggt ccc aac ggg gga tgt gtc cat gcg gtg gtt tgt 480 
Asn Trp Cys Pro Gly Pro Asn Gly Gly Cys Val His Ala Val Val Cys 
145 150 155 160 

tta agg cag ggt ccg ttg tac ggc ccg ggg gcg act gtg gca ggt cct 528 
Leu Arg Gin Gly Pro Leu Tyi Gly Pro Gly Ala Thr Val Ala Gly Pro 
165 170 175 
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caa caa agg ggc agt cac ttt gtg gta taa 558 
Gin Gin Arg Gly Ser His Phe Val Val 
180 185 

<210> 71 
<211> 185 
<212> PRT 

<213> Physarum polycephalum 
<400>71 

Met Ala Lys Ser Asn Gin Ala Arg Val Asn Gly Gly Ser Asn Tyr Asp 
15 10 15 

Ser Leu Thr Pro Leu Asn Met Ala Leu Thr Asn Ala Gin lie Leu Ala 
20 25 30 

Val lie Asp Ser Trp Glu Glu Thr Val Gly Gin Phe Pro Val He Thr 
35 40 45 

His His Val Pro Leu Gly Gly Gly Leu Gin Gly Thr Leu His Cys Tyr 
50 55 60 

Glu lie Pro Leu Ala Ala Pro Tyr Gly Val Gly Phe Ala Lys Asn Gly 
65 70 75 80 

Pro Thr Arg Trp Gin Tyr Lys Arg Thr He Asn Gin Val Val His Arg 
85 90 95 

Trp Gly Ser His Thr Val Pro Phe Leu Leu Glu Pro Asp Asn lie Asn 
100 105 110 

Gly Lys Thr Cys Thr Ala Ser His Leu Cys His Asn Thr Arg Cys His 
115 120 125 

Asn Pro Leu His Leu Cys Trp Glu Ser Leu Asp Asp Asn Lys Gly Are 
130 135 140 

Asn Trp Cys Pro Gly Pro Asn Gly Gly Cys Val His Ala Val Val Cys 
145 150 155 160 

Leu Arg Gin Gly Fro Leu Tyr Gly Pro Gly Ala Thr Val Ala Gly Pro 

165 170 175 

Gin Gin Arg Gly Ser His Phe Val Val 
180 185 



<210> 72 

<211> 34 

<212> DNA 

<213> Escherichia coli 

<220> 

<221> promoter 
<222> (1)..(34) 

<223> coding for promoter derived from consensus of 
sigma70 E.coli promoters 
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<400> 72 

ttgacaattt tcctctgaat tatataatta acat 34 

<210>73 
<211> 413 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: artificial 
intron TetIVS2a 

<400> 73 

aaatagcaat atttttcgtt gccttaaaaa gttatcaggc atgcacctgg tagctagtct 60 
ttaaaccaat agattgcatc ggtttaaaag gcaagaccgt caaattgcgg gaaaggggtc 120 
aacagccgtt cagtaccaag tctcagggga aactttgaga tggccttgca aagggtatgg 180 
taataagctg acggacatgg tcctaaccac gcagccaagt cctaagtcaa cagatcttct 240 
gttgatatgg atgcagttca cagactaaat gtcggtcggg gaagatgtat tcttctcata 300 
agatatagtc ggacctctcc ttaatgggag ctagcggatg aagtgatgca acactggagc 360 
cgctgggaac taatttgtat gcgaaagtat attgattagt tttggagtac teg 413 

<210> 74 
<211> 791 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: insert of 
plasmidpCB459-l 

<400> 74 

ttacaatttc cattcgccat teaggctgeg caactgttgg gaagggegat eggtgeggge 60 
ctcttcgcta ttacgccagc tggegaaagg gggatgtgct geaaggegat taagttgggt 120 
aacgecaggg ttttcccagt cacgacgttg taaaacgacg gecagtgage gegegtaata 180 
cgactcacta tagggegaat tgggtacegg gccccccctc gaggtcgacg gtatcgataa 240 
gcttgacccg acaaggaatt tegctcgagt actccaaaac taatcaatat actttegcat 300 
acaaattagt tcccagcggc tccagtgttg catcacttca tccgctagct cccattaagg 360 
agaggtccga ctatatctta tgagaagaat acatcttccc cgaccgacat ttagtctgtg 420 
aactgcatcc atatcaacag aagatctgtt gacttaggac ttggctgcgt ggttaggacc 480 
atgtccgtca gcttattacc ataccctttg caaggccatc tcaaagtttc ccctgagact 540 
tggtactgaa cggctgttga cccctttccc gcaatttgac ggtcttgect tttaaaccga 600 
tgcaatctat tggtttaaag actagctacc aggtgcatgc ctgataactt tttaaggcaa 660 
cgaaaaatat tgetatttae cttaggaccg ttaggatcca ctagttctag agcggccgcc 720 
accgcggtgg agctccagct tttgttccct ttagtgaggg ttaattgege gcttggcgta 780 
atcatggtca t 791 

<210> 75 
<211> 1291 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Description of the artificial sequence: insert of 
vector pCB478-3 

<400> 75 

ttecaatttc cattcgccat tcaggctgcg caactgttgg gaagggcgat cggtgcgggc 60 
ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct gcaaggcgat taagttgggt 120 
aacgccaggg ttttcccagt cacgacgttg taaaacgacg gccagtgagc gcgcgtaata 180 
cgactcacta tagggcgaat tgggtaccgg gccccccctc gaggtcgacg gtatcgataa 240 
gcttgacccg acaaggaatt tcgctcgagt actccaaaac taatcaatat actttcgcat 300 
acaaattagt tcccagcggc tccagtgttg catcacttca tccgctagct cccattaagg 360 
agaggtccga ctatatctta tgagaagaat tttgatcctt actcgccagt ttcttcaaaa 420 
cgtttgatag gttcctggta gtttttctca ttgatacgga gtttttttgc taaatccaac 480 
actttctcga atccttcctg cgattctaaa tgtctacctg cttttaacag caagcagatt 540 
tttcgaaatc gaagaaattc aattcttctt tttgttttta gctcatgttt ttcgaaaaat 600 
ggtataatga cggttataaa gtgatcaata tttttgactt tatagcaaag ccgttcccca 660 
tgattcacaa cgactgatcc aactttaaag tagtctttta atgcatataa aacatttgca 720 
tctaatttat gttgggtcac tacgaattaa ttctggttga agctgatggc catagcgcaa 780 
cgaattattt ctaagtatac ttacactaaa acaaccttcc ccatctacga aacctgtaat 840 
ccattgagga rtaatgtcca tggtccctcc ctacaacgtc tgcagatcaa acatcttccc 900 
cgaccgacat ttagtctgtg aactgcatcc atatcaacag aagatctgtt gacttaggac 960 
ttggctgcgt ggttaggacc atgtccgtca gcrtattacc ataccctttg caaggccatc 1020 
tcaaagtttc ccctgagact tggtactgaa cggctgttga cccctttccc gcaatttgac 1080 
ggtcttgcct tttaaaccga tgcaatctat tggtttaaag actagctacc aggtgcatgc 1140 
ctgataactt tttaaggcaa cgaaaaatat tgctatttac cttaggaccg ttaggatcca 1200 
ctagttctag agcggccgcc accgcggtgg agctccagct tttgttccct ttagtgaggg 1260 
ttaattgcgc gcttggcgta atcatggtca t 1291 

<210> 76 
<211> 1386 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: insert of 
vector pCB492-25 

<400> 76 

gcgcgcgtaa tacgactcac tatagggcga attgggggcg actgtttacc aaaaacacag 60 
gtctccgcaa agtcgtaaga ccatgtatgg gggctgacgc ctgcccagtg ccggaaggtc 120 
aaggaagttg gtgacctgat gacaggggag ccggcgaccg aagccccggt gaacggcggc 180 
cgtaactata acggtcctaa ggtaaatagc aatatttttc gttgccttaa aaagttatca 240 
ggcatgcacc tggtagctag tctttaaacc aatagattgc atcggtttaa aaggcaagac 300 
cgtcaaattg cgggaaaggg gtcaacagcc gttcagtacc aagtctcagg ggaaactttg 360 
agatggcctt gcaaagggta tggtaataag ctgacggaca tggtcctaac cacgcagcca 420 
agtcctaagt caacagatct tctgttgata tggatgcagt tcacagacta aatgtcggtc 480 
ggggaagatg tttgatctgc agacgttgta gggagggacc atggacatta atcctcaatg 540 
gattacaggt ttcgtagatg gggaaggttg ttttagtgta agtatactta gaaataattc 600 
gttgcgctat ggccatcagc ttcaaccaga attcgtagtg acccaacata aattagatgc 660 
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aaatgtttta tatgcattaa aagaclactt taaagttgga tcagtcgttg tgaatcatgg 720 
ggaacggctt tgctataaag tcaaaaatat tgatcacttt ataaccgtca ttataccatt 780 
tttcgaaaaa catgagctaa aaacaaaaag aagaattgaa tttcttcgat ttcgaaaaat 840 
ctgcttgctg ttaaaagcag gtagacattt agaatcgcag gaaggattcg agaaagtgtt 900 
ggatttagca aaaaaactcc gtatcaatga gaaaaactac caggaatcta tcaaacgttt 960 
tgaagaaact ggcgagtaag gatcaaaatt cttctcataa gatatagtcg gacctctcct 1020 
taatgggagc tagcggatga agtgatgcaa cactggagcc gctgggaact aatrtgtatg 1080 
cgaaagtata ttgattagtt ttggagtact cgagcgaaat tccttgtcgg gtaagttccg 1140 
acccgcacga aaggcgtaac gatctgggca ctgtctcgga gagaggctcg gtgaaataga 1200 
catgtctgtg aagatgcgga ctacctgcacctggacagag agaccccaat tcctggccta 1260 
gtctatagga ggttttgaaa agaaaggagc aataatcatt ttcttgttct atcaagaggg 1320 
tgctattgct cctttctttt tttctcgacc agcttttgtt ccctttagtg agggttaatt 1380 
gcgcgc 1386 

<210> 77 
<211> 60 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 77 

tagtaaatga caattttcct ctgaattata taattaacat ggcgactgtt taccaaaaac 60 

<210> 78 
<211> 31 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 78 

ctcaattggg gtctctctgt ccaggtgcag g 31 

<210> 79 
<2U> 1258 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: PCR product 
Prom-TetIVS2a-Cpa 

<400> 79 

tagtaaatga caattttcct ctgaattata taattaacat ggcgactgtt taccaaaaac 60 
acaggtctcc gcaaagtcgt aagaccatgt atgggggctg acgcctgccc agtgccggaa 120 
ggtcaaggaa gttggtgacc tgatgacagg ggagccggcg accgaagccc cggtgaacgg 180 
cggccgtaac tataacggtc ctaaggtaaa tagcaatatt tttcgttgcc ttaaaaagtt 240 
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atcaggcatg cacctggtag ctagtcttta aaccaataga ttgcatcggt ttaaaaggca 300 
agaccgtcaa attgcgggaa aggggtcaac agccgttcag taccaagtct caggggaaac 360 
tttgagatgg ccttgcaaag ggtatggtaa taagctgacg gacatggtcc taaccacgca 420 
gccaagtcct aagtcaacag atcttctgtt gatatggatg cagttcacag actaaatgtc 480 
ggtcggggaa gatgtttgat ctgcagacgt tgtagggagg gaccatggac attaatcctc 540 
aatggattac aggtttcgta gatggggaag gttgttttag tgtaagtata cttagaaata 600 
attcgttgcg ctatggccat cagcttcaac cagaattcgt agtgacccaa cataaattag 660 
atgcaaatgt tttatatgca ttaaaagact actttaaagt tggatcagtc gttgtgaatc 720 
atggggaacg gctttgctat aaagtcaaaa atattgatca ctttataaoc gtcattatac 780 
catttttcga aaaacatgag ctaaaaacaa aaagaagaat tgaatttctt cgatttcgaa 840 
aaatctgctt gctgttaaaa gcaggtagac atttagaatc gcaggaagga ttcgagaaag 900 
tgttggattt agcaaaaaaa ctccgtatca atgagaaaaa ctaccaggaa tctatcaaac 960 
gttttgaaga aactggcgag taaggatcaa aattcttctc ataagatata gtcggacctc 1020 
tccttaatgg gagctagcgg atgaagtgat gcaacactgg agccgctggg aactaatttg 1080 
tatgcgaaag tatattgatt agttttggag tactcgagcg aaattccttg tcgggtaagt 1140 
tccgacccgc acgaaaggcg taacgatctg ggcactgtct cggagagagg ctcggtgaaa 1200 
tagacatgtc tgtgaagatg cggactacct gcacctggac agagagaccc caattgag 1258 
<210>80 
<211> 4746 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: insert of 
vector pCB435-45 

<400>80 

gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagtgatga 60 
cttgacggca tcctcacctt cctccggctt atcaccggca gtctgttcag ggttccaaac 120 
tcaacgatgg caactaaaca cgagggttgc gctcgttgcg ggacttaacc caacacctta 180 
cggcacgagc tgacgacagc catgcaccac ctgtgtccgc gttcccgaag gcacccctct 240 
ctttcaagag gattcgcggc atgtcaagccctggtaaggt tcttcgcttt gcatcgaatt 300 
aaaccacatg ctccaccgct tgtgcgggcc cccgtcaatt ccttttagtt tcattcttgc 360 
gaacgtactc cccaggcggg atacttaacg cgttagctac agcactgcac gggtcgatac 420 
gcacagcgcc tagtatccat cgtttacggc taggactact ggggtatcta atcccattcg 480 
ctcccctagc tttcgtctct cagtgtcagt gtcggcccag cagagtgctt tcgccgttgg 540 
tgttctttcc gatctctacg catttcaccg ctccaccgga aattccctct gcccctaccg 600 
tactccagct tggtagtttc caccgcctgt ccagggttga gccctgggat ttgacggcgg 660 
acttaaaaag ccacctacag acgctttacg cccaatcatt ccggataacg cttgcatcct 720 
ctgtattacc gcggctgctg gcacagagtt agccgatgct tattccccag ataccgtcat 780 
tgcttcttct ccgggaaaag aagttcacga cccgtgggcc ttctacctcc acgcggcatt 840 
gctccgtcag gctttcgccc attgcggaaa attccccact gctgcctccc gtaggagtct 900 
gggccgtgtc tcagtcccag tgtggctgat catcctctcg gaccagctac tgatcatcgc 960 
cttggtaagc tattgcctca ccaactagct aatcagacgc gagcccctcc tcgggcggat 1020 
tcctcctttt gctcctcagc ctacggggta ttagcagccg tttccagctg ttgttcccct 1080 
cccaagggca ggttcttacg cgttactcac ccgtccgcca ctggaaacac cacttcccgt 1140 
ccgacttgca tgtgttaagc atgccgccag cgttcatcct gagccaggat cgaactctcc 1200 
atgagattca tagttgcatt acttatagct tccttgttcg tagacaaagc ggartcggaa 1260 



CA 02470329 2004-06-14 



0817/00028 

36 



ttgtctttca ttccaaggca taacttgtat ccatgcgctt catattcgcc cggagttcgc 1320 
tcccagaaat atagccatccctgocccctc acgtcaatcc cacgagcctc ttatccattc 1380 
tcattgaacg acggcggggg agcgcgggcc ccccctcgac atatatttta aataatataa 1440 
agggggttcc aacatattaa tatatagtga agtgttccgg atccactagg acgttgtagg 1500 
gagggaccat ggacattaat cctcaatgga ttacaggttt cgtagatggg gaaggttgtt 1560 
ttagtgtaag tatacttaga aataattcgt tgcgctatgg ccatcagctt caaccagaat 1620 
tcgtagtgac ccaacataaa ttagatgcaa atgttttata tgcattaaaa gactacttta 1680 
aagttggatc agtcgttgtg aatcatgggg aacggctttg ctataaagtc aaaaatattg 1740 
atcactttat aaccgtcatt ataccatttt tcgaaaaaca tgagctaaaa acaaaaagaa 1800 
gaattgaatt tcttcgattt cgaaaaatct gcttgctgtt aaaagcaggt agacatttag 1860 
aatcgcagga aggattcgag aaagtgttgg atttagcaaa aaaactccgt atcaatgaga 1920 
aaaactacca ggaatctatc aaacgttttg aagaaactgg cgagtaactc gagaaaaaaa 1980 
gaaaggagca atagcaccct cttgatagaa caagaaaatg attattgctc ctttcttttc 2040 
aaaacctcct atagaciagg ccaggaatta tctgcagtta tttgccaact accttagtga 2100 
tctcgccttt cacgtagtgg acaaattctt ccaactgatc tgcgcgcgag gccaagcgat 2160 
cttcttcttg tccaagataa gcctgtctag cttcaagtat gacgggctga tactgggccg 2220 
gcaggcgctc cattgcccag tcggcagcga catccttcgg cgcgattttg ccggttactg 2280 
cgctgtacca aatgcgggac aacgtaagca ctacatttcg ctcatcgcca gcccagtcgg 2340 
gcggcgagtt ccatagcgtt aaggtttcat ttagcgcctc aaatagatcc tgttcaggaa 2400 
ccggatcaaa gagttcctcc gccgctggac ctaccaaggc aacgctatgt tctcttgctt 2460 
ttgtcagcaa gatagccaga tcaatgtcga tcgtggctgg ctcgaagata cctgcaagaa 2520 
tgtcattgcg ctgccattct ccaaattgca gttcgcgctt agctggataa cgccacggaa 2580 
tgatgtcgtc gtgcacaaca atggtgacttctacagcgcg gagaatctcg ctctctccag 2640 
gggaagccga agtttccaaa aggtcgttga tcaaagctcg ccgcgttgtt tcatcaagcc 2700 
ttacggtcac cgtaaccagc aaatcaatat cactgtgtgg cttcaggccg ccatccactg 2760 
cggagccgta caaatgtacg gccagcaacg tcggttcgag atggcgctcg atgacgccaa 2820 
ctacctctga tagttgagtt gatacttcgg cgataaccgc ttcacgagcc atgaatccct 2880 
ccctacaact agatcctcgc ccggagttcg ctcccagaaa tatagccatc cctgccccct 2940 
cacgtcaatc ccacgagcct cttatccatt ctcattgaac gacggcgtcg agggggggcc 3000 
cggtaccaaa tccaactaga aaaactcaca ttgggcttag ggataatcag gctcgaactg 3060 
atgacttcca ccacgtcaag gtgacactct accgctgagt tatatccctt ccccgcccca 3120 
tcgagaaata gaactgacta atcctaagtc aaagggtcga gaaactcaac gccactattc 3180 
ttgaacaact tggagccggg ccttcttttc gcactattac ggatatgaaa ataatggtca 3240 
aaatcggatt caattgtcaa ctgcccctat cggaaatagg attgactacc gattccgaag 3300 
gaactggagt tacatctctt ttccattcaa gagttcttat gcgtttccac gcccctttga 3360 
gaccccgaaa aatggacaaa ttccttttct taggaacaca tacaagattc gtcactacaa 3420 
aaaggataat ggtaacccta ccattaacta cttcatttat gaatttcata gtaatagaaa 3480 
tacatgtcct accgagacag aatttggaac ttgctatcct cttgcctagc aggcaaagat 3540 
ttacctccgt ggaaaggatg attcattcgg atcgacatga gagtccaact acattgccag 3600 
aatccatgtt gtatatttga aagaggttga cctccttgct tctctcatgg tacactcctc 3660 
ttcccgccga gccccttttc tcctcggtcc acagagacaa aatgtaggac tggtgccaac 3720 
aattcatcag actcactaag tcgggatcac taactaatac taatctaata taatagtcta 3780 
atatatctaa tataatagaa aatactaata taatagaaaa gaactgtctt ttctgtatac 3840 
tttccccggt tccgttgcta ccgagggcct cgtgatacgc ctatttttat aggttaatgt 3900 
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 3960 
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 4020 
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TZ^TI Ct !° ragCgg 'egtttgttt gccggatcaa gagctaccaa ctctttttcc 4080 
gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta 4140 
gttaggocac cacttcaaga actctgtagc accgcctaca tacctcgctc tg^at^OO 
SS? 8 gC ? 8CtgCCa gtcgtgtctt accgggttgg at*caagacg 4260 

222? 8 ggCgC agC£gtCggg Ctg8a£ « ggg Xtcgjca cacagcLg 4320 
cttggagcga acgacctaca ccgaactgag atacctacag cgTgagctet gagaaagcgc 4380 

SSS 0 eaa888agaa a88Cggacag ****** mSX) 

HP 8 ?! agggagCttc ^gcctggtat c^Xgtc ^gtSgtt 4500 

tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggl ggagSf 4560 

fm gcc agcaacgcgg ""*** SSwcSSSS 

catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg T^tttgaSio 

4746 

<210> 81 
<211> 1213 
<212> DNA 

<213> Artificial sequenoe 
<220> 

<223> Description of the artificial sequence: probe for 
Southern blot analysis (16SrDNA) 

<400> 81 

£22? CagtCtgttC agggttccaa a ^cgat ggcaactaaa cacgagggtt 60 
gcgctcgttg cgggacttaa cccaacacct tacggcacga gctgacgaca eaatecacc 120 
acctgtgtcc gcgttcccga aggcacccct cte^^iwSSSSSm^ 
ccctggtaag gttcttcgct ttgcatcgaa ttaaaccaca 'SLJ?JSS!u0 
ccoccgtcaa ttccttttag tttcattctt gcgaacgtac tcxccaggcg ggafaSafsM 
cgcgttagct acagcactgc acgggtcgat acgcacagcg ccteg^fcc atcgtttacg360 
gctaggacta ctggggtatc taatcccatt cg^i^a,^^ 0 
gtgtcggccc agcagagtgc tttcgccgttggtgttottt ccgatctcta cgcattfcac 480 
cgctccaccg gaaattccct ctgcccctac cgtactccag cttggtagtt tccaccgcct540 
gtccagggtt gagccctggg atttgacggc ggacttaaaa agccacctac agacgcttta 600 
cgcccaatca ttccggataa cgcttgcatc ctctgtatta c*gcggctgc tggLSgag 660 
ttagccgatg cttattcccc agataccgtc attgcttctt ctccggfaaf agSS 720 
gaoccgtggg ccttctacct ccacgcggca ttgctccgtc ag^cgc ccattgcgga 780 
aaattcccca ctgctgcctc ccgtaggagt ctgggccgtg tcfcagtJc agtS 840 
atcatectct cggaccagct actgatcatc gccttggtaltTgctattg 8 ^ „S 
ctaatcagac gcgagcccct cctcgggcgg attectcctt ttgctectca gcctacgggg 960 
tattagcagc cgtttccagc tgttgttccc ctcccaaggg caggttctta cicgttactc 1020 
acccgtccgc cactggaaac accacttccc gtccgacttg catgtgttaa gcatgcxgcc 1080 
22 ;S tC S 83gCCagg atCgaaCtCt catalttgca tL^fllJo 

aTcca^gcS 8 ^ 8C88attC8g aattgtCttt Catt< Sf ^"S* 1200 

<210> 82 
<211> 949 
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<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: probe for 
Southern blot <23SrDNA) 

<400>82 

catcgtcgag agggaaacag cccggatcac cagctaaggc ccctaaatga tcgctcagtg 60 
ataaaggagg taggggtgca gagacagcca ggaggtttgc ctagaagcag ccacccttga 120 
aagagtgcgt aatagctcac tgatcgagcg ctcttgcgcc gaagatgaac ggggctaagc 180 
gatctgccga agctgtggga tgtaaaaata catcggtagg ggagcgttcc gccttagaga 240 
gaagcctccg cgcgagcggt ggtggacgaa gcggaagcga gaatgtcggc ttgagtaacg 300 
caaacattgg tgagaatcca atgccccgaa aacctaaggg ttcctccgca aggttcgtcc 360 
acggagggtg agtcagggcc taagatcagg ccgaaaggcg tagtcgatgg acaacaggtg 420 
aatattcctg tactgcccct tgttggtccc gagggacgga ggaggctagg ttagccgaaa 480 
gatggttatc ggttcaagaa cgtaaggtgt ccctgctttg tcagggtaag aaggggtaga 540 
gaaaatgcct cgagccaatg ttcgaatacc aggcgctacg gcgctgaagt aacccatgcc 600 
atactcccag gaaaagctcg aacgactttg agcaagaggg tacctgtacc cgaaaccgac 660 
acaggtgggt aggtagagaa tacctagggg cgcgagacaa ctctctctaa ggaactcggc 720 
aaaatagccc cgtaacttcg ggagaagggg tgcctcctca caaagggggt cgcagtgacc 780 
aggcccgggc gactgtttac caaaaacaca ggtctccgca aagtcgtaag accatgtatg 840 
ggggctgacgcctgcccagt gccggaaggt caaggaagtt ggtgacctga tgacagggga 900 
gccggcgacc gaagccccgg tgaacggcgg ccgtaactat aacggtcct 949 

<210> 83 
<211> 1287 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: insert of 
vector pCB456-2 

<400>83 

gagctctcag tactcgagac atttcaccct agaaatagac ttaaacttta ctggcttaac 60 
tttaaccttg gaccataaaa ggcaccccat tgcgagagtg cccttgatta accaaatgaa 120 
acgaagtcta accgaaagca gttatggcaa tggcagaatt ctgatcacgg aagatagctt 180 
tggcaaaaaa agcaaaaagc atttaccttg attgagatgt taattgtgtt ggcaattatc 240 
agtattttaa ttttgctttt tgtgccaaat ttgatactag agcttcgggt gccagggcgt 300 
gcccttgggc tccccgggcg cgtactcgac gctaccttaa gagagtcaag ctaattctaa 360 
tctgcagtct agcgtgcggc cgctctagaa ctagtggatc ccccgggctg cagttatttg 420 
ccaactacct tagtgatctc gcctttcacg tagtggacaa attcttccaa ctgatctgcg 480 
cgcgaggcca agcgatcttc ttcttgtcca agataagcct gtctagcttc aagtatgacg 540 
ggctgatact gggccggcag gcgctccatt gcccagtcgg cagcgacatc cttcggcgcg 600 
attttgccgg ttactgcgct gtaccaaatg cgggacaacg taagcactac atttcgctca 660 
tcgccagccc agtcgggcgg cgagttccat agcgttaagg tttcatttag cgcctcaaat 720 
agatcctgtt caggaaccgg atcaaagagt tcctccgccg ctggacctac caaggcaacg 780 
ctatgttctc ttgcttttgtcagcaagata gccagatcaa tgtcgatcgt ggctggctcg 840 
aagatacctg caagaatgtc attgcgctgc cattctccaa attgcagttc gcgcttagct 900 
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ggataacgcc acggaatgat gtcgtcgtgc acaacaatgg tgacttctac agcgcggaga 960 
atctcgctet ctccagggga agccgaagtt tccaaaaggt cgttgatcaa agctegSg? 1020 
gttgtttcat caagccttac ggtcaccgta accagcaaat caatatcact gtgtggcttc 1080 
aggccgocat ocactgcgga gccgtacaaa tgtacggcca gcaacgtcgg ttcgagatgg 1140 
cgctcgatga cgccaactac ctctgatagt tgagttgata cttcggcgat aaccgcU ifoo 
cgagccatga gatcctccag atccatgtat cattatagat aattgaagag tgaatgtcaa 1260 
gtcgacctcgagggggggcccggtacc ;L287 
<210> 84 
<211> 1387 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: insert of 
vector pCB528-2 

<400>84 

ggtaccccag ttaactggag atctcgagga ggtcacatgg gagcttggat tgaacaagat 60 
ggattgcacg caggttctcc ggccgcttgg gtggagaggc tattcgJL tfactS 120 

SSE? tCg8CtgCtC tg3tgCCgCC ^gcgca iggcf Sg !80 

gttctttttg tcaagaccga cctgtccggt gccctgaatg aactgcagga cglSgtaL, 240 
cggctatcgt ggctggccac gacgggcgtt ccttgcg^g ct^tglfcga 8 «£2 300 

caccttgctc ctgccgagaa agtatccatc atggctgatg caatgcgfcg ectecatac* 4?n 
cttgatccgg ctacctgccc attcgaccac cafgcgaaac atcfcSS Sg^ 480 

pcS? aa8< f ggtCt tgtCgatca g g^gg acgaagalcaTcfgIS 540 
gcgccagccg aactgttcgc caggctcaag gcgcgcatgc ccgacggcga ggatotcgtc 600 

ft£cr c ?i CgatgCCtg CttgCCgaat atcat ^ ^ccg ciggateo 
ttcatcgact gtggccggct gggtgtggcg gaccgctatc aggacatagc gttggScc 720 
cgtgatattg ctgaagagct tggcggcgaa tgggctgacc grttcctcgl gaS 7 80 
atcgccgctc ccgattcgca gcgcatcgcc ttctatcgcc ttrttgacga gJcttcSo 
SS25 agS l CatCat ggCgCtC3CC tcttggctgt lattgaclgt 900 

^ggaagaaa cagtcggtca gtttccagtg ataacgcacc atgtaccttt agglggcL 960 
ctgcaaggaa cgctccattg ttacgagatc cccctagcag ctccttatgg gglgTc^oio 

£SS? g8CCtaCCCg CtggC3ataC aaac S* a «" tcaatclfgf cgfcSga 1080 
tgggggtccc acacagtccc ttttctatta gaaccggata ac^twacgg caaLcctgc 1140 
acagcatcgc acctatgtca taatactcga tgccacaatc ccttgcactt gtgctgggag 1200 

S acaaagg cagaaactgg ^^^JSSS^n 

gcggtggttt gtttaaggca gggtccgttg tacggcccag gggcgacfglYgcagScct 1320 

8CagtCaCtt tgtggtataa CtgCagaa e c «taa|c4toactglTl380 
ggagctc 13g7 
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We claim: 

1. A method for the integration of a DNA sequence into the 
5 plastid DNA of a multi-celled plant or cell derived therefrom 
and for the selection of predominantly homotransplastomic 
cells or plants, wherein 

a) the plastid DNA molecules of said multi-celled plant 
10 comprise at least one recognition sequence for the 

directed induction of DNA double -strand breaks and 

b) at least one enzyme suitable for the induction of DNA 
double-strand breaks at the recognition sequence for the 

15 directed induction of DNA double-strand breaks and at 

least one transformation construct comprising an 
insertion sequence are combined in at least one plastid 
of said multi-celled plant or cell derived therefrom, and 

20 

c) DNA double-strand breaks are induced at the recognition 
sequences for the directed induction of DNA double-strand 
breaks, and 

25 d) the insertion sequence inserts into the plastid DNA, the 
functionality of the recognition sequence for the 
directed induction of DNA double-strand breaks being 
deactivated so that said recognition sequence is no 
longer capable of being cleaved by the enzyme suitable 

30 for the induction of DNA double-strand breaks, and 

e) plants or cells in which the insertion sequence has been 
inserted into the plastid DNA molecules are isolated. 

35 2. A method according to claim 1, wherein the recognition 

sequence for the directed induction of DNA double-strand 
breaks and the insertion sequence are flanked at least 
unilaterally by sequences with sufficient length and 
sufficient homology to ensure homologous recombination with 

40 each other. 

3. A method according to either of claims 1 and 2, wherein the 
transformation construct encompasses at least one of the 
elements selected from the group consisting of 

45 
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expression cassette for an enzyme suitable for the 
induction of DMA double-strand breaks at the 
recognition sequence for the directed induction of DNA 
double-strand breaks 



ii) positive selection markers 

iii) negative selection markers 

iv) reporter genes 

v) replication origins 

vi) multiple cloning regions 

vii) border sequences for agrobacterium transfection 

viii) sequences which make possible a homologous 
recombination or insertion into the genome of a host 
organism. 

A method according to any of claims 1 to 3, wherein an enzyme 
suitable for the induction of DNA double-strand breaks at the 
recognition sequence for the directed induction of dna 
double-strand breaks is selected from the group consisting of 
restriction endonucleases, homing endonuc leases , chimeric 
nucleases, mutated restriction or homing endonucleases and 
RNA protein particles derived from mobile group II introns 
and fusion proteins of the abovementioned proteins with ' 
plastid localization sequences. 

A method according to any of claims 1 to 4, wherein the 
enzyme suitable for the induction of DNA double-strand breaks 
at the recognition sequence for the directed induction of DNA 
double-strand breaks is selected from the group of the homing 
endonucleases consisting of F-Scel, F-Scell, F-Suvl, F-TevI 
F-xevll, l-Amal, I-Anil, i-ceul, 1-CeuAllP, i-chul, l-Cmoel' 
I-Cpal, I-Cpall, I-Crel, I-Crepsbip, I-Crepsbllp, 
I-CrepsbIIIP, 1-CrepsbIVP, i-csml, I-Cvul, I-CvuAIP, I-Ddili 
I-Dirl, I-Dmol, 1-HspNlP, i-Llal, l-Msol, i-Naal, I-Nanl, ' 
I-NclIP, 1-NgrIP, I-NitI, I-Njal f I-Nsp236IP, I-PakI, 
I-PboIP, I-PcuIP, I-PcuAI, I-PcuVI, I-PgrlP, I-PobIP, l-Porl 
1-PorllP, 1-PpblP, i-Ppol, I-SPBetalP, l-scal f I-Scel, 
I-Scell, I-Scein, I-ScelV, 1-SceV, I-SceVI, I-SceVII, 
1-SexlP, l-sneip, I-SporaCP, 1-SpomlP, 1-SpomllP, 1-SqulP, 
I-Ssp6803l, I-SthPhiJP, I-SthPhiST3P, I-SthPhiS3bP, I-TdeIP 
I-Tevl, I-Tevii, i-Tevlli, 1-OarAP, 1-UarHGPAlP, 
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I-UarHGPA13P, I-VinIP, I-ZbiIP r Pl-MtuI, PI-MtuHIP, 
PI-MtuHIIP, Pl-Pful, Pl-Pfull, Pi-Pkol, Pl-Pkoll, Pl-Pspl, 
PI-Rma43812IP, PI-SPBetalP, Pl-Scel, PI-Tful, PI-TfuII, 
Pl-Thyl, PI-Tlil, PI-Tlill and fusion proteins of the above 
5 with plastid localization sequences. 

6. A method according to any of claims 1 to 5, wherein the 
enzyme suitable for the induction of DNA double-strand breaks 
at the recognition sequence for the directed induction of DNA 

10 double-strand breaks is selected from the group of the homing 
endonuc leases consisting of the enzymes as shown in SEQ ID 
NO: 5, 12, 14 and 71. 

7. A method according to any of claims 1 to 6, wherein the 

15 enzyme suitable for the induction of DNA double-strand breaks 
at the recognition sequence for the directed induction of DNA 
double-strand breaks is expressed using an expression 
cassette encompassing a nucleic acid sequence encoding said 
enzyme, 

20 

8. An expression cassette comprising an enzyme suitable for the 
induction of DNA double-strand breaks at the recognition 
sequence for the directed induction of DNA double-strand 
breaks as defined in any of claims 4 to 6 under the control 

25 of a promoter which is functional in plant plastids. 

9. An expression cassette comprising a fusion protein of a 
plastid translocation sequence and an enzyme suitable for the 
induction of DNA double-strand breaks at the recognition 

30 sequence for the directed induction of DNA double-strand 

breaks as defined in any of claims 4 to 6 under the control 
Of a promoter which is functional in the plant nucleus. 

10. A multi-celled plant obtained by a method according to any of 
35 claims 1 to 7. 

11. A multi-celled plant comprising an expression cassette 
according to claim 8 inserted into the plastid DNA. 

40 12. A multi-celled plant comprising an expression cassette 
according to claim 9 inserted into the nuclear DNA. 

13. A multi-celled plant according to either of claims 10 and 12 
selected from the group consisting of Arabidopsis thaliana, 
45 tobacco r Tagetes, wheat, rye, barley, oats, oilseed rape, 
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^inut! at °' SUgar S ° ybean ' sunflower ' P^Pkin/squash 



5 



14. a cell culture, organ, tissue, part cr transgenic propagation 
material derived from a multi-celled plant according to 
claims 10 fn n 3 



15. 

10 



15 



20 



claims 10 to 13. 

The use of an organism according to any of claims 10 to 13 or 
of cell cultures, organs, tissues, parts or transgenic 
propagation material derived therefrom according to claim 14 
as food, feed or seed or for the preparation of 
pharmaceuticals or fine chemicals. 

" * IJ"? !° n8trUCt COI »P*"ing *t least one nucleic acid sequence 
and intron sequence element which are capable of ensuring, in 
a ribonucleic acid sequence derived from said dna construct, 
the deletion of the ribonucleic acid fragment encoding said 
nucleic acid sequence, said nucleic acid sequence being 
heterologous with regard to said intron sequence elements. 

17. A DNA construct according to claim 16, wherein the nucleic 
acid sequence is flanked by at least one splice acceptor 
sequence and a splice donor sequence. 



25 18. 



30 



DNA constr «ct according to claim 16 or 17, wherein the DNA 
construct comprises, at the 5' and 3' end, sequences HI and 
H2, respectively, with a sufficient length and homology with 
plastid sequences HI ' and H2 • , respectively, to ensure 
homologous recombination between HI and HI', or H2 and H2' 
and thus insertion of the HI- and H2-flanked sequence into' 
the plastome. 



19. A transgenic plasid DNA comprising at least one nucleic acid 
sequence and intron sequence elements which are capable of 

35 ensuring, in a ribonucleic acid sequence derived from said 
transgenic plastid DNA construct, the deletion of the 
ribonucleic acid fragment encoding said nucleic acid 
sequence, said nucleic acid sequence being heterologous with 
regard to said intron sequence elements. 

40 

20. a transgenic plastid DNA according to claim 19, wherein the 
nucleic acid sequence is flanked by at least one splice 
acceptor sequence and a splice donor sequence. 

45 21. a transgenic plant comprising a transgenic plastid DNA 
according to claim 20. 
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5* Exon „ Intron 





cucucuAAAUAG 

1:111:111 

GAAAAGGGAGGUUUCCAUUU 

I I I I I I : 

GAGUACUCGuaagguag - 



Intron 3 * Exon 



5 : Exon „ Intron 



uccuaagguaagguAAAUAG 

11111:1 1 

GAAAAAUUCCGUUGCUUUUU 

1:11111 

GAGUACUCGagcgaaa — 




Intron 3 ' Exon 
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